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Abstract 


Finding  shortest  paths  and  determining  visibilities  are  problems  encountered  every 
day.  Formal  versions  of  these  problems  are  important  in  computational  geometry. 
Both  kinds  of  problems  specify  a  space  and  a  set  of  opaque,  impenetrable  obstacles  in 
the  space.  Visibility  problems  ask  what  an  observer  would  see  if  placed  in  the  space; 
shortest  path  problems  seek  the  minimum  length  route  for  an  object  moving  among 
the  obstacles.  This  thesis  provides  algorithms  for  several  visibility  and  shortest  path 
problems,  illuminating  the  relationship  between  the  two  classes. 

The  first  part  of  the  thesis  considers  problems  in  which  the  space  is  the  plane  and 
the  obstacles  are  non-intersecting  line  segments.  It  presents  a  worst-case-optimal 
algorithm  to  find  the  visibility  graph  of  the  set  of  segments;  that  is,  it  computes 
what  would  be  seen  by  observers  standing  at  all  the  segment  endpoints.  It  then 
uses  this  information  to  find  shortest  paths  for  a  non-rotating  convex  body  moving 
among  the  segments. 

The  second  part  of  the  thesis  provides  several  optimal  algorithms  for  shortest 
path  and  visibility  problems  inside  simple  polygons  that  have  already  been  trian¬ 
gulated.  In  this  setting,  the  polygon  walls  are  the  only  obstacles.  The  most  basic 
problem  considered  is  that  of  finding  all  shortest  paths  from  a  particular  vertex  to 
other  vertices.  The  solution  to  this  problem  can  be  applied  to  solve  several  visibility 
problems,  including  that  of  finding  the  visibility  graph  of  a  simple  polygon  in  time 
proportional  to  its  size.  The  thesis  concludes  by  presenting  geometric  structures 
and  corresponding  data  structiures  to  solve  several  query  problems  efiiciently,  in¬ 
cluding  the  following:  the  shooting  problem,  which  asks  where  a  bullet  fired  inside 
a  polygon  would  hit  its  boundary,  and  the  two-point  shortest  path  problem,  which 


111 


asks  for  the  length  of  the  shortest  path  between  two  arbitrary  query  points. 

Throughout  the  thesis,  solutions  to  shortest  path  problems  help  solve  visibility 
problems,  and  vice  versa.  The  use  of  a  single  technique  to  solve  problems  of  both 
classes  demonstrates  the  close  ties  between  the  two. 
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Preface 


All  of  the  work  described  in  this  thesis  has  been  or  will  be  published  separately. 
My  thanks  go  to  the  holders  of  the  copyrights  for  permission  to  use  the  material 
here.  In  accordance  with  their  request,  no  part  of  this  thesis  should  be  reproduced 
for  direct  commercial  zidvantage. 

A  version  of  Chapter  2  appeared  as  a  joint  paper  with  Takao  Asano,  Tetsuo 
Asano,  Leo  Guibas,  and  Hiroshi  Imai.  It  appeared  in  the  proceedings  of  the  1985 
IEEE  Symposium  on  Foxmdations  of  Computer  Science  [AAG*85]  and  was  pub¬ 
lished  in  revised  form  in  Algorithmica  [AAG*86].  The  joint  paper  was  a  result  of 
simultaneous  independent  discovery.  Leo  Guibas  and  I  found  an  O(n^)  visibility 
graph  algorithm  in  early  1985,  which  we  described  in  the  Bulletin  of  the  European 
Association  for  Theoretical  Computer  Science  [GH85].  The  other  three  authors 
achieved  the  same  result  by  similar  means  at  about  the  same  time,  so  we  published 
our  work  together.  The  prose  of  Chapter  2  is  my  (substantial)  revision  of  Imai’s 
first  draft. 

Chapter  3  is  mostly  my  own  work.  It  appeared  as  a  DEC/SRC  technical  re¬ 
port  coauthored  with  Leo  Guibas  [HG86],  and  will  be  published  in  the  Journal  of 
Algorithms. 

Chapter  4  is  another  case  of  simultaneous  discovery.  Leo  Guibas  and  I  found 
algorithms  for  all  the  problems  discussed  in  that  chapter  except  the  convex  rope 
problem  of  Section  4.1.1,  When  we  learned  that  Micha  Sharir,  Daniel  Leven,  and 
Robert  Tarjan  had  independently  solved  the  seime  problems,  we  combined  our  work 
with  theirs  for  publication.  The  resulting  paper  appeared  in  the  1986  ACM  Sympo¬ 
sium  on  Computational  Geometry  [GHL*86]  and  will  be  published  in  revised  form 


in  Algorithmica.  Most  of  the  words  in  Chapter  4  come  from  Sharir’s  draft  of  the 
paper.  The  algorithm  of  Appendix  A,  which  I  developed,  appeared  in  this  paper. 

The  algorithms  of  Chapter  5  are  my  own.  A  paper  describing  them  will  be 
published  in  the  1987  ACM  Symposium  on  Computational  Geometry  [Her87]. 

The  results  described  in  Chapter  6  will  also  appear  in  the  1987  ACM  Symposium 
on  Computational  Geometry  [GH87].  The  work  is  chiefly  my  own,  though  Leo 
Guibas  made  important  contributions. 
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Chapter  1 
Introduction 


Learn  everything;  you  will  see  afterwards  that  nothing  is  superfluous. 

—  Hugh  of  St.  Victor  (early  twelfth  century) 


Visibility  problems  and  shortest  path  problems  occiu:  frequently  in  everyday 
life.  We  often  want  to  know  what  is  visible  from  some  point,  say  a  comer  office 
window,  or  how  to  move  an  object,  say  a  case  of  Limburger  cheese,  from  one  place 
to  another  as  quickly  as  possible.  Related  problems  arise  in  computer  science  in 
application  areas  including  graphics  and  robotics.  In  graphics,  the  hidden  surface 
removal  problem  is  a  visibility  problem  of  fundamental  importance.  In  robotics, 
computer  vision  and  robot  motion  planning  pose  a  variety  of  visibility  and  shortest 
path  problems.  This  thesis  addresses  formal  versions  of  these  problems,  which  are 
important  in  computational  geometry. 

Both  types  of  problems  specify  a  space  and  set  of  opaque,  impenetrable  obstacles 
in  the  space.  Visibility  problems  then  give  a  position  or  set  of  positions  in  the 
space  and  ask  what  part  of  the  space  is  visible  from  those  positions.  Shortest 
path  'problems  specify  an  object  to  be  moved  and  its  initial  and  final  positions;  the 
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desired  solution  is  a  path  for  the  object  between  the  given  positions  that  avoids  the 
obstacles  and  has  minimum  length,  if  any  such  path  exists.  There  are  many  variants 
on  the  problems:  the  space  may  have  two,  three,  or  more  dimensions;  the  obstacles 
may  be  subject  to  constraints  (convexity,  for  example);  the  moving  object  may  have 
constraints  on  its  shape  or  motion;  and  any  of  several  definitions  of  visibility  and 
distance  may  be  used.  This  thesis  uses  Euclidean  length  as  the  distance  metric 
for  shortest  paths  and  straight-line  visibility  in  its  visibility  problems.  (A  precise 
definition  of  visibility  appears  below.) 

Even  the  simplest  shortest  path  problems  seem  to  be  very  difficult  in  three  or 
more  dimensions;  the  best  algorithms  known  for  a  point  moving  among  polyhedral 
obstacles  require  time  exponential  in  the  number  of  vertices  of  the  obstacles.  There¬ 
fore,  this  thesis  considers  shortest  path  problems  in  the  plane,  where  the  obstacles 
are  disjoint  simple  polygons  with  a  total  of  n  vertices.  The  polygons  are  not  al¬ 
lowed  to  intersect  because  intersections  effectively  increase  the  number  of  vertices. 
For  precise  characterization  of  algorithm  running  times,  it  is  important  to  have  all 
vertices  be  explicit. 

The  definition  of  visibility  reqtiires  some  care.  At  least  two  slightly  different 
definitions  are  in  current  use.  The  first  definition,  which  is  used  in  this  work,  says 
that  two  points  are  mutually  visible  if  the  open  segment  connecting  them  does  not 
intersect  any  obstacle.  This  is  a  natural  definition:  it  corresponds  to  what  we  mean 
by  visibility  in  daily  life.  We  call  this  definition  the  segment  definition.  It  has 
been  used  by  several  authors,  including  Asano  et  al.  [AAG*86]  and  Chazelle  and 
Guibas  [CG85].  A  competing  definition  has  been  used  by  El  Gindy  and  Avis  [EA81] 
and  Guibas  et  al.  [GHL*86],  among  others.  This  definition,  which  we  call  the 
shortest  path  definition,  says  that  two  points  axe  mutually  visible  if  the  segment 
connecting  them  does  not  intersect  the  interior  of  any  obstacle. 

Both  definitions  have  advantages  and  disadvantages.  The  chief  advantage  of  the 
shortest  path  definition  is  its  close  relationship  to  shortest  paths:  two  points  are 
mutually  visible  if  and  only  if  the  obstacle-avoiding  shortest  path  between  them 
is  a  straight  segment.  To  understand  the  advantages  of  the  segment  definition, 
let  us  consider  visibility  from  a  point  v.  Let  visible{v,  6)  be  the  set  of  obstacle 
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points  visible  from  v  in  direction  $.  Under  the  segment  definition,  vi3ible{v,6)  is 
a  function:  there  is  at  most  one  obstacle  point  visible  in  any  direction.  Under  the 
shortest  path  definition,  visible{v,d)  can  include  arbitrarily  many  obstacle  points; 
in  fact,  if  the  obstacles  have  degeneracies,  visible{v,  9)  can  include  arbitrarily  many 
obstacle  vertices.  The  set  of  all  points  visible  from  u  is  a  connected  region  of  the 
plane  under  both  definitions,  but  under  the  shortest  path  definition  it  can  have 
protruding  rays  and  segments,  as  in  Figure  1.1(a).  A  point  on  one  of  these  rays 
has  no  visible  points  in  its  neighborhood  except  those  on  the  ray.  By  contrast,  the 
region  visible  from  v  imder  the  segment  definition,  shown  in  Figure  1.1(b),  has  no 
such  H angling  segments  or  rays;  however,  it  is  not  necesseirily  a  closed  region,  as  is 
the  region  visible  from  v  under  the  shortest  path  definition. 

Many  authors  minimize  the  differences  between  the  two  definitions  by  assuming 
that  no  three  obstacle  vertices  are  collinear  (counting  v  among  them).  One  of 
the  goals  of  this  thesis  is  to  cope  gracefully  with  degeneracies,  so  the  algorithms 
presented  here  use  the  segment  definition  of  visibility,  which  has  fewer  inherent 
liabilities  in  the  presence  of  collinear  points.  When  the  algorithms  need  to  compute 
the  region  visible  from  a  point  (which  may  not  be  closed),  they  calculate  instead 
its  boundary,  which  is  always  closed. 

The  algorithms  presented  in  this  thesis  depend  on  the  close  relationship  between 
shortest  paths  and  visibility.  If  two  points  are  mutually  visible,  the  visibility  segment 
connecting  them  is  also  the  shortest  obstacle-avoiding  path  between  them.  On  the 
other  hand,  a  shortest  path  for  a  point  moving  among  polygonal  obstacles  is  a 
polygonal  path;  each  consecutive  pair  of  vertices  on  the  path  is  either  mutually 
visible  or  connected  by  a  polygon  edge.  The  algorithms  that  follow  exploit  this 
connection  in  both  directions:  Chapters  2  and  3  use  visibility  information  to  find 
shortest  paths,  while  Chapters  4  and  5  construct  shortest  paths,  then  use  them  to 
solve  visibility  problems. 

The  thesis  is  divided  into  two  main  sections.  The  first  part  addresses  prob¬ 
lems  involving  arbitrarily  many  polygonal  obstacles,  while  the  second  part  obtains 
improved  bounds  for  the  special  case  in  which  there  is  only  one  polygon. 

Chapters  2  and  3  comprise  the  first  part  of  the  thesis.  Chapter  2  solves  the 
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Figure  1.1.  Two  definitions  of  visibility:  (a)  the  shortest  path  definition, 
and  (b)  the  segment  definition. 
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visibility  gr&ph  and  visibility  polygon  problems  for  a  set  of  n  disjoint  line  segments 
(a  slight  generalization  of  disjoint  polygons).  The  visibility  polygon  problem  asks 
for  the  boundary  of  the  region  of  the  plane  visible  from  a  query  point  v.  That 
is,  the  desired  solution  is  the  function  described  above.  The  chapter 

gives  an  algorithm  to  find  the  visibility  polygon  of  an  arbitrary  query  point  in  0{n) 
time  after  O(n^)  preprocessing.  Clearly,  the  algorithm  is  most  useful  when  many 
visibility  polygons  must  be  computed  for  the  same  set  of  obstacles.  The  visibility 
graph  problem  is  an  ideal  application.  The  visibility  graph  is  a  combinatorial  graph 
structure  with  segment  endpoints  as  nodes  and  an  edge  between  every  pair  of  mu> 
tually  visible  vertices.  It  can  be  built  by  finding  the  visibility  polygon  from  every 
vertex.  Since  the  visibility  graph  can  have  @(n^)  edges,  this  construction  algorithm 
is  worst-case  optimal  if  all  edges  must  be  listed. 

As  noted  above,  shortest  paths  between  segment  endpoints  follow  visibility  graph 
edges  and  polygon  edges.  It  is  possible  to  find  the  shortest  path  between  two  vertices 
in  O(n^)  time  by  building  the  visibility  graph,  augmenting  it  with  polygon  edges, 
and  then  running  Dijkstra’s  shortest  path  algorithm  for  graphs  on  it  [AHU74,  pages 
207-209].  Dijkstra's  algorithm  runs  in  0(n?)  time  on  this  graph  when  implemented 
using  the  Fibonacci  heaps  of  Fredman  and  Tarjan  [FT84].  (This  bound  assumes 
that  the  Euclidean  distance  between  two  points  can  be  computed  in  constant  time, 
and  also  that  stuns  of  such  distances  can  be  compared  in  constzmt  time.)  Chap¬ 
ter  3  extends  this  approach  to  find  shortest  paths  for  a  non-rotating  convex  object. 
The  extension  uses  the  ‘‘configuration  space**  technique  of  Lozano-Perez  and  Wes¬ 
ley  [LW79]  to  transform  the  problem  of  moving  a  convex  object  among  obstacles 
into  that  of  moving  the  object*s  center  among  “fattened**  versions  of  the  obstacles. 
An  analogue  to  the  visibility  graph,  called  the  path  graph,  records  all  the  sub¬ 
paths  that  belong  to  shortest  paths  among  the  fattened  obstacles.  The  algorithm 
of  Chapter  3  constructs  this  graph  &om  the  visibility  graph  of  the  polygons,  then 
uses  Dijkstra*s  algorithm  to  find  the  desired  shortest  path  in  O(n^)  time. 

The  algorithms  of  Chapters  2  and  3  require  fl(n^)  time  because  they  do  not 
take  into  accotmt  the  structure  of  the  obstacles.  Chapters  4  through  6  improve  on 
these  bounds  when  the  only  obstacle  is  a  single  simple  polygon  P  with  n  vertices. 
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In  this  setting  shortest  paths  and  visibility  segments  lie  inside  the  polygon.  All 
of  the  algorithms  presented  here  are  triangidation-based:  given  a  triangulation  of 
the  polygon  (a  partitioning  of  its  interior  into  n  —  2  disjoint  triangles  by  n  —  3  non¬ 
intersecting  diagonals),  the  algorithms  nm  in  linear  time,  which  is  optimal.  The  best 
triangulation  algorithm  known  is  that  of  Taxjan  and  Van  Wyk  [TV86],  which  runs 
in  0(n  log  log  n)  time.  (Triangulation-based  linear  algorithms  like  those  presented 
here  should,  of  course,  be  contrasted  with  linear-time  algorithms  on  “raw”  simple 
polygons,  such  as  calculation  of  the  convex  hull  of  such  a  polygon  P  [GY83,MA79] 
and  calculation  of  the  subpolygon  of  P  visible  from  a  given  point  [EA81,Lee83].) 

Chapter  4  presents  an  algorithm  that,  given  a  triangulation  of  a  simple  poly¬ 
gon  P,  computes  all  shortest  paths  from  a  sotirce  inside  P  to  the  vertices  of  the 
polygon.  This  structure,  called  the  shortest  path  tree,  is  then  used  to  solve  a  va¬ 
riety  of  shortest  path  and  visibility  problems.  These  problems  include  calculation 
of  the  subpolygon  of  P  visible  from  a  given  segment  within  P,  preprocessing  P 
for  fast  “ray  shooting”  queries,  and  several  related  problems.  Chapter  5  uses  the 
shortest  path  tree  plus  some  euiditional  techniques  to  btiild  the  visibility  graph  of  P 
in  0(m  -1-  n  log  log  n)  time,  where  m  is  the  number  of  edges  in  the  visibility  graph. 
This  improves  substantially  on  the  O(n^)  algorithm  of  Chapter  2,  since  m  can  be 
as  small  as  0(n). 

The  shortest  path  tree  of  Chapter  4  can  be  enhanced  to  answer  shortest  path 
queries.  After  a  linear  amotmt  of  additional  processing,  it  can  be  used  to  compute 
in  O(log  n)  time  the  distance  from  the  source  to  an  arbitrary  query  point  inside  P. 
The  path  itself  can  be  extracted  in  time  proportionad  to  the  number  of  turns  along 
it.  A  limitation  of  this  approach  is  that  the  source  is  fixed;  if  the  source  is  also 
arbitrary,  queries  may  take  linear  time  to  answer.  Chapter  6  corrects  the  flaw, 
introducing  data  structures  that  allow  arbitrary  shortest  path  queries.  Once  the 
structures  are  built  (in  linear  time  and  speure),  the  length  of  the  shortest  path 
between  two  arbitrary  points  can  be  computed  in  0(log  n)  time,  and  the  path  itself 
can  be  produced  in  time  proportional  to  the  number  of  turns  along  it.  The  data 
structure  has  several  further  applications:  for  example,  it  can  be  used  to  solve  the 
all-farthest  neighbors  problem  inside  P  in  0(n  log  n)  time,  and  to  build  the  relative 
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convex  htiU  of  m  points  inside  P  in  0(m  log  m  +  m  log  n)  time. 


Chapter  2 

Visibility  Polygons  and  the 
Visibility  Graph  of  Segments 


I  can  see  clearly  now;  the  rain  is  gone. 
I  can  see  all  obstacles  in  my  way. 

—  Johnny  Nash,  “I  Can  See  Cleeurly  Now”  (1972) 


The  visibility-polygon  problem,  or  the  hidden-line  elimination  problem  in  the 
plane,  is  of  fundamental  importance  in  computer  graphics.  It  is  also  important  in 
computational  geometry,  since  it  is  often  used  as  a  subproblem  in  other  geometric 
problems,  such  as  the  shortest-path  problem  in  the  plane  with  polygonal  obstacles. 
The  problem  is  formally  stated  as  follows:  Given  a  set  of  h  disjoint  polygons  with  n 
edges  and  a  query  point  q,  report  in  polar  order  all  the  boundary  points  of  those 
polygons  that  are  visible  from  q.  For  the  case  of  a  single  simple  polygon,  El  Gindy 
and  Avis  [EA81]  and  Lee  [Lee83]  presented  0(n)  time  algorithms.  Asano  [Asa84] 
gave  an  0(n  +  h  log  h)  time  algorithm  for  the  case  where  the  h  disjoint  polygons 
are  convex,  and  an  0(n  log  h)  time  algorithm  for  the  general  problem. 
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In  this  chapter  we  consider  a  canonical  case  of  the  visibility  polygon  problem, 
in  which  the  obstacles  are  strmght-line  segments:  Given  a  set  of  n  segments  with 
arbitrary  slopes,  non-intersecting  except  at  their  endpoints,  and  an  arbitrary  query 
point  q,  find  the  parts  of  those  segments  that  are  visible  &om  q,  where  a  point  p 
is  visible  from  q  if  the  open  line  segment  intersects  no  segment  in  the  set  (Fig¬ 
ure  2.1).  We  use  VM(5,q)  to  refer  to  the  set  of  all  points  visible  from  q  in  the 
presence  of  S,  and  VP{S,q),  or  just  VP(q),  to  denote  the  set  of  points  of  S  visible 
from  q.  The  set  VP(q)  lies  on  the  boimdary  of  Vis{S,q).  This  chapter  presents  two 
algorithms  that  compute  VP{q)  in  0(n)  time  with  O(n^)  preprocessing  time  and 
0(n^)  space.  The  algorithms  can  easily  be  modified  to  solve  the  visibility  problem 
for  a  set  of  disjoint  polygons  with  n  edges  in  the  same  complexity  boimds. 


Figure  2.1.  The  visibility  problem  for  a  set  of  segments 

Our  methods  are  especially  efficient  when  the  visibility  problem  is  solved  repeat¬ 
edly  for  the  same  set  of  polygons,  which  is  the  case  in  constructing  the  visibility 
graph.  The  visibility  graph  of  disjoint  polygons  with  n  edges  can  be  found  by  solv¬ 
ing  the  visibility  problem  from  each  vertex  of  those  polygons,  and  hence  it  can  be 
constructed  in  0(n*)  time  and  space.  This  problem  had  been  previously  solved 
in  O(n^logn)  time  by  Lee  [Lee78].  Independently,  Welzl  [Wel85]  has  discovered  a 
different  O(n^)  solution.  Since  the  visibility  graph  can  have  size  n(n^),  om:  methods 
and  Welzl’s  are  optimal  to  within  a  constant  factor,  if  we  sisstune  that  zdl  the  edges 
of  the  graph  must  be  listed. 
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The  visibility  graph  can  be  used  to  find  shortest  paths.  The  graph  formed 
by  adding  all  polygon  edges  to  the  visibility  graph  contains  all  shortest  paths  be¬ 
tween  polygon  vertices.  The  shortest  path  between  a  particuleir  pair  of  vertices 
can  be  found  in  0(rP)  time  by  applying  Dijkstra’s  algorithm  (AHU74,  pages  207- 
209][FT84]  to  this  graph  (Lee  and  Preparata  [LP84],  Sharir  and  Schorr  [SS84]). 
This  improves  the  0(n^  log  n)  botmd  for  the  shortest-path  problem  due  to  Sharir 
and  Schorr  [SS84].  However,  this  is  not  the  best  bound  known  for  the  shortest  path 
problem.  When  the  obstacles  consist  of  k  disjoint  polygons,  the  algorithm  of  Reif 
and  Storer  [RS85]  solves  the  shortest  path  problem  in  0{kn  -H  nlogn)  time. 

The  algorithms  mentioned  in  the  preceding  paragraph  find  shortest  paths  for 
a  point  moving  among  polygonal  obstacles.  Chapter  3  uses  the  visibility  graph 
to  solve  a  more  difficult  problem,  that  of  finding  shortest  paths  for  a  non-rotating 
convex  object. 


2.1  The  visibility  problem  for  a  set  of  segments 

Let  5  be  a  set  of  n  segments  with  arbitrary  slopes  allowed  to  intersect  only  at 
their  endpoints,  and  let  9  be  an  arbitrary  query  point.  We  shall  describe  in  top- 
down  fashion  two  algorithms  for  finding  the  parts  of  the  segments  in  5  that  are 
visible  firom  q.  Let  pi  (*  =  1, . . . ,  iV)  be  all  the  endpoints  of  segments  in  5,  where 
N  <  2n.  For  simplicity,  we  assume  that  no  two  points  in  the  set  {?,pi, . . .  ,PAr}  have 
the  same  x-coordinate  (this  assumption  can  be  removed  by  techniques  of  Chazelle, 
Guibas  and  Lee  [CGL85]).  Our  algorithms  also  work  when  the  segment  endpoints 
P19  •  •  •  iPn  have  distinct  x-coordinates  and  q  €  {pi, . . .  ,pjv}*  The  algorithms  solve 
the  visibility  problem  by  decomposing  it  into  the  following  two  subproblems. 

(1)  Polar  Sorting:  Consider  a  polar  coordinate  system  with  the  point  q  as  the 
origin  and  the  vertical  ray  directed  upward  &om  q  as  the  reference.  Let  us  denote  the 
polar  angle  of  a  point  p  by  ^,(p),  where  the  poleir  angle  increases  counterclockwise 
aroimd  q.  Problem  (1)  is  to  find  a  sorted  list  of  p,-  (t  =  1, . . . ,  iV)  in  increasing  order 
of  9,(pi). 
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We  define  the  rank  «(p»)  of  point  p,-  (i  =  1, . . . ,  iV)  to  be  its  rank  in  the  sorted 
list  obtained  in  Problem  (1).  Points  with  the  same  9,  are  given  the  same  rank. 
The  maximum  rank  is  JV,  <  N.  For  0  <  i  <  JV,  +  1,  we  use  a(t)  for  the  angle 
corresponding  to  rank  i,  where  a(0)  =  0  and  a(JV’*  + 1)  =  2ir. 

Before  we  introduce  the  second  subproblem,  let  us  perform  certain  normaliza¬ 
tions  on  the  set  5.  We  first  remove  the  segments  whose  endpoints  and  q  are  coUinear, 
and  then  break  into  two  separate  parts  each  segment  in  5  that  intersects  the  vertical 
ray  f  emanating  upward  firom  q.  The  newly  created  endpoints  all  have  the  same 
x-coordinate,  but  this  causes  no  trouble;  our  algorithms  require  only  the  original 
endpoints  to  have  distinct  x-coordinates.  We  denote  the  resulting  set  of  segments 
by  Sgt  and  let  n,  =  |5,|.  For  two  segments  s  and  s'  in  we  say  that  s  is  in 
front  of  s',  written  s  -<,  s',  if  there  exists  a  non-vertical  ray  emanating  firom  q 
that  intersects  both  segments  and  hits  s  before  s'.  Chazelle  [Cha83]  shows  that  the 
relation  -<,  can  be  embedded  in  a  total  order;  see  Figure  2.2(a).  One  of  our  two 
algorithms  computes  such  a  total  order  of  the  segments  in  Sg  compatible  with  -<g. 
For  each  segment  s  in  Sg,  we  define  j/(s)  to  be  the  rank  (from  1  to  n,)  of  s  in  this 
total  order. 


Figure  2.2.  Solving  the  visibility  problem.  In  (b),  the  x-coordinates 
correspond  to  the  polar  order  of  the  segment  endpoints,  and  the 
y-coordinates  correspond  to  a  total  ordering  of  the  segments  based  on 
the  “in  front  of”  relation. 
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We  define  the  interval  Ix{s)  of  each  segment  in  5,  as  follows:  Suppose  the 
endpoints  of  s  in  5  are  u  and  v  with  x(u)  <  x(v).  If  s  does  not  intersect  r,  then  s  is 
in  Sq  £ind  we  let  /«(s)  be  [x(u),x(t;)].  If  on  the  other  hand  s  intersects  r  at  w,  then 
it  gives  rise  to  two  segments  uW  and  WW  in  Sqy  and  we  set  Ix(uvJ)  to  be  [0,  x(u)]  and 
Ix{Wv)  to  be  [x(w),iVj.  +  1]. 

The  visibility  polygon  of  q  is  defined  by  considering  rays  that  emanate  from  q 
and  proceed  vmtil  one  of  the  given  line  segments  is  encountered.  Notice  that  all  the 
rays  in  the  angular  range  {ot{k),  a{k  + 1)),  0  <  k  <  Ng,  have  their  first  intersection 
with  the  same  segment.  We  allow,  of  course,  for  the  possibility  that  a  ray  may 
encotmter  no  segment  at  all  in  some  angular  intervals. 

(2)  Visibility  of  Sorted  Segments:  For  each  1;  =  0, 1, . . . ,  iVs,  we  define  visible{k) 
to  be  the  segment  in  Sq  intersected  by  all  rays  from  q  in  the  angular  interval 
(a{k),  a(i(;+l)).  If  no  such  segment  exists,  we  set  visible{k)  to  be  0.  Problem  (2)  is  to 
find  a  polar  view  of  the  intervals,  that  is,  to  find  visible(k)  for  each  k  =  0,l,...,Nx 
(see  Figmre  2.2(b)). 

This  definition  of  viaiblt  is  a  variant  of  the  definition  iised  in  the  introduction. 
This  is  a  discrete  version  of  that  continuous  definition. 

Even  though  Sq  does  not  include  segments  lying  on  lines  through  g,  such  seg¬ 
ments  may  occur  in  5.  The  visible  portion  of  a  segment  lying  on  a  ray  from  q  is 
at  most  its  endpoint  p  nearer  q.  Visibility  can  be  checked  by  determining  which  is 
the  nearest  to  q,  on  the  ray  emanating  from  q  towards  p,  among  p  and  segments 
visible(k  —  1)  and  visible{k),  where  k  =  x(p).  If  q  lies  on  a  segment  of  5,  nothing  is 
visible  from  q  in  the  directions  of  the  segment  endpoints.  Thus,  if  we  have  already 
solved  problems  (1)  and  (2),  we  can  find  in  polar  order  the  parts  of  the  segments 
of  S  that  are  visible  from  q  in  0(n)  time. 

In  the  above  problems,  as  depicted  in  Figure  2.2,  we  cut  the  plane  along  the  ray 
emanating  upward  from  q  and  spread  it  out  according  to  the  angular  and  radial 
orders  x(pi)  and  y(s).  (Note  that  although  it  is  convenient  to  think  of  embedding 
the  “in  front  of”  relation  X,  in  a  total  order,  it  is  not  necessary  to  do  so.)  In  this 
spread-out  view,  the  lowest  segment  in  interval  [fc,  fc  -H 1]  is  visible{k)',  Problem  (2)  is 
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to  find  the  lowest  segment  in  all  intervals,  the  lower  envelope  of  the  set  of  segments. 
It  will  be  convenient  in  our  second  algorithm  to  think  of  Problem  (2)  in  terms  of 
the  spread-out  view. 

In  the  next  section  we  establish  the  following  lemma: 

Lemma  2.1.  Using  O(n^)  preprocessing  time  and  O(n^)  space  for  constructing  an 
arrangement  of  lines,  we  can  £nd  the  sorted  list  of  pi  (i  —  ,N)  in  increasing 

order  of  Bq{pi)  in  0{n)  time  for  each  query  point  q. 

In  Sections  2.3  and  2.4  we  present  two  different  linear  solutions  to  Problem  (2). 
The  first  approach  uses  a  triangulation  to  construct  the  total  order  y(s)  explicitly  in 
linear  time,  then  applies  the  linear  set-\mion  algorithm  of  Gabow  and  Tarjaa  [GT83] 
to  find  visible(k)  for  0  <k  <  Ng  in  0(n)  time.  The  second  method  uses  a  scanline 
to  sweep  through  the  segment  endpoints  in  polar  order.  By  breaking  each  segment 
in  5,  into  two  pieces,  it  produces  regularized  segments  for  which  visibilities  can  be 
fotmd  in  0(n)  time  using  bit  operations  or  table  lookup. 

From  the  lemma  and  the  preceding  discussion,  we  obtain  the  following  theorem: 

Theorem  2.1.  For  a  set  of  n  arbitrarily  oriented  segments  and  a  query  point  q, 
the  parts  of  the  segments  that  are  visible  from  q  can  be  found  in  0(n)  time  with 
O(n^)  preprocessing  time  and  0{v?)  space. 


2.2  Polar  sorting  and  duality 

For  the  set  of  points  p,-  (*  =  1, . . . ,  iV),  consider  the  following  geometric  transforma¬ 
tion  T,  as  used  by  Brown  [Bro80]:  Let  /  be  a  non- vertical  line  in  the  plane  whose 
points  (x,  y)  satisfy  y  =  a/x  +  bi,  and  let  p  =  (xp,  yp)  be  a  point  in  the  pleme.  Trans¬ 
formation  T  maps  /  into  the  point  pr(0  =  P  hip)  whose 

points  (a,  b)  satisfy  b  —  — XpO  +  yp.  This  transformation  is  an  exzunple  of  a  duality 
mapping. 

Geometric  duality  in  two  dimensions  maps  lines  to  points  and  points  to  lines 
while  preserving  their  relationships.  For  example,  if  two  lines  map  to  a  pair  of 
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points  in  the  dual  space,  their  intersection  point  maps  to  the  line  connecting  the 
dual  points.  Duality  is  a  powerful  tool;  for  examples  of  its  use  in  computational 
geometry,  see  [CGL85,CG85,GRS83,Sto87].  We  use  duality  again  in  Section  4.3. 

Using  the  transformation  T,  we  can  solve  Problem  (1)  as  follows:  In  the  prepro¬ 
cessing  step,  we  map  the  points  p,-  (i  =  1, . . . ,  JV)  into  lines  by  T.  These  lines  and 
their  intersections  induce  a  subdivision  of  the  mapped  plane,  which  we  construct 
in  O(n^)  time  and  space  using  the  algorithm  of  Chazelle,  Guibas  and  Lee  [CGL85] 
or  Edelsbruimer,  O’Rourke  and  Seidel  [EOS83].  Such  a  subdivision  is  called  the 
arrangement  of  the  lines  [Gru72].  Given  a  query  point  q,  we  map  it  by  T  into 
the  line  /r(9)  a^d  insert  it  into  the  arrangement,  which  can  be  done  in  0(n) 
time  [CGL85,EOS83].  The  line  coimecting  points  q  and  p,-  in  the  original  plane  cor¬ 
responds  to  the  point  of  intersection  of  the  two  lines  /t(9)  and  /t(p<)  in  the  mapped 
plane.  (Because  no  two  points  in  {9,pi, . . .  ,P7v}  have  the  same  x-coordinate,  there 
are  no  parallel  lines  in  the  arrangement.)  The  ordering  by  slope  of  the  lines  con¬ 
necting  q  and  p,-  (i  =  1, . . . ,  N)  corresponds  to  the  ordering  by  x-coordinate  of  the 
points  of  intersection  of  the  line  Iriq)  with  lines  Iripi)  (*  =  1, . . . ,  iV)  in  the  mapped 
plane.  Since  we  have  the  arrangement  at  hand,  this  ordering  can  be  obtuned  in 
0(n)  time. 

For  a  set  of  points  p,-  with  0  <  ^,(p,)  <  ir  (resp.  tt  <  0q{pi)  <  27r),  the  ordering 
of  these  points  in  increasing  order  of  9q{pi)  is  precisely  the  ordering  of  them  in 
increasing  order  of  the  slopes  of  the  lines  joining  q  and  p,-.  Thus  the  ordering  of  the 
points  Pi  in  increasing  order  of  9q  can  be  obtained  from  their  ordering  with  respect 
to  slope  in  0(n)  time.  This  proves  Lemma  2.1. 


2.3  Finding  the  visibility  polygon:  triangulation 
and  set-union 

In  our  first  solution  to  Problem  (2),  we  produce  a  total  order  p(s)  in  which  the 
“in  front  of”  relation  may  be  embedded.  To  do  this,  we  triangulate  the  set  S 
of  segments  in  the  preprocessing  step;  specifically,  we  triangulate  the  convex  hull 
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of  S  so  that  all  the  segments  in  5  are  edges  of  the  triangulation.  This  can  be 
done  in  0(n  log  n)  time  and  0(n)  space  (El  Gindy  and  Toussaint  [ET84],  Gaxey  et 
al.  [GJPT78]).  We  denote  by  T  the  set  of  edges  of  this  triangulation,  where  we  have 
SCT. 

For  a  given  q,  we  modify  T  to  produce  a  new  triangulation.  To  do  this,  we 
first  stretch  a  line  segment  up  from  q  to  the  boundary  of  the  convex  hull.  When 
we  add  this  segment,  some  faces  become  non-triangles.  Adding  new  segments  to 
such  faces,  we  can  obtain  in  0(n)  time  a  new  triangulation  whose  edge  set,  denoted 
by  Tg,  contains  5,.  Then  we  construct  a  directed  graph  G(T,)  whose  vertex  set 
is  Tg  and  whose  arcs  are  obtained  as  follows:  For  each  triangle  composed  of  edges 
ei,  62,  C3  e  T,  as  depicted  in  Figture  2,3, 

•  if  point  q  is  in  the  interior  of  region  Ri,  then  add  arcs  from  62  to  Ci  and  from  €3 
to  Cl  (the  cases  where  q  is  in  the  interior  of  region  R2  or  R3  are  similar); 

•  if  point  q  is  in  the  interior  of  iZi,  then  add  arcs  from  Cj  to  62  and  from  ei  to  63 
(the  cases  where  q  is  in  the  interior  of  region  ^2  or  .R3  are  similar); 

•  if  point  q  is  on  the  half  line  /12,  but  not  a  vertex  of  the  triangle,  add  an  arc 
from  C3  to  C2  (the  cases  where  q  is  on  I21,  hi,  hz,  hs  or  /32  are  similar); 

•  if  point  q  is  on  some  edge  or  in  the  interior  of  the  triangle,  do  nothing. 

For  this  graph  G(T,),  we  have  the  following  lemma. 

Lemma  2.2.  The  directed  graph  G{Tg)  is  acyclic.  Furthermore,  for  s,  s'  €  5,  such 
that  s  -<g  s',  there  is  a  directed  path  from  3'  to  s  in  G(Tg). 

Proof:  Let  us  consider  extending  the  relation  X,  to  all  edges  of  T,  to 
get  X' .  The  relation  can  still  be  embedded  in  a  total  order  [Chzi83],  and 
since  the  arcs  of  G(T,)  record  a  subset  of  X^,  G(Tg)  must  be  acyclic. 

Any  two  segments  s,s'  €  Sg  with  s  X,  s'  intersect  a  common  ray 
from  q.  The  x'  relations  between  edges  of  T,  that  cross  this  ray  at 
adjacent  intersections  are  all  recorded  in  G{Tg),  and  hence  there  is  a 
directed  path  from  s'  to  s  in  G(Tg).  | 
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Figure  2.3.  Constructing  the  directed  graph  G{Tg) 

A  topological  order  of  G(T,)  can  be  found  in  0(n)  time  (note  that  the  number 
of  vertices  and  axes  of  G{Tg)  is  0(n)).  From  the  topological  order,  which  is  a  total 
order  on  T,,  we  can  obtain  in  0(n)  time  a  total  order  on  5,  in  which  the  relation 
is  embedded. 

In  order  to  obtain  a  linear-time  algorithm  for  Problem  (2),  we  employ  the  set- 
union  algorithm  of  Gabow  and  Tarjan  [GT83],  which  solves  the  following  problem: 
Initially,  there  are  AT*  +  2  disjoint  sets  {0},  {1},  {2}, . . . ,  {iV,},  {Ng  -I- 1};  two  op¬ 
erations  may  be  performed  on  this  system  of  disjoint  sets,  find{k)  and  link(k)  for 
0  <  k  <  Ng..  The  function  find{k)  returns  the  maximum  element  of  the  set  contain¬ 
ing  k,  and  link{k)  unites  the  set  containing  k  with  the  set  containing  fc-|- 1.  Gabow 
and  Tzurjan’s  algorithm  executes,  on-line,  a  sequence  of  0(iV*  -|-  rig)  find  and  link 
operations  in  0{Nx  +  ng)  time  [GT83].  Using  these  operations.  Problem  (2)  can  be 
solved  by  the  algorithm  of  Figure  2.4. 

The  algorithm  keeps  track  of  contiguous  blocks  of  intervals  for  which  visible(k) 
is  known.  It  maintains  the  following  invariant:  two  elements  k  and  k'  {k  <  k^) 
belong  to  the  same  set  if  and  only  if  visible{j)  has  been  found  for  every  k  <  j  <  k\ 
Thus,  for  any  k  in  the  range  0  <  fc  <  Nx,  if  find{k)  ^  iV,  -H  1,  then  visible{find{k)) 
is  unknown.  This  means  that  the  algorithm  sets  visible^k)  correctly  and  at  most 
once  for  each  0  <  ik  <  iV*.  Hence  find  ^lnd  link  operations  are  executed  0(n)  times 
and  the  algorithm  runs  in  linear  time. 
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for  j  :=  1  to  n,  do 
begin 

Let  s  be  the  segment  in  5,  with  y(s)  =  j; 

Let  [/,  r]  be  the  interval  Ix{s)  of  s; 

I  :=  find{l); 

while  /  <  r  do 
begin 

visible{l)  :=  s;  link(l)]  I  :=  find{l) 

end 

end 

Figure  2.4.  A  set-union  solution  to  the  visibility  problem 

2.4  Finding  the  visibility  polygon:  scanlines  and 
segment  splitting 

The  second  algorithm  for  Problem  (2)  does  not  compute  the  y(s)  order  of  the 
segments;  it  does  not  require  either  a  triangulation  or  the  set-union  data  structure. 
Instead,  it  uses  a  polar  scanline  to  sweep  through  the  sorted  list  of  segments  and 
find  the  visible  sub-segments.  It  will  be  convenient  in  this  algorithm  to  think  of  the 
segments  as  being  spread  out  as  in  Figure  2.2(b);  Problem  (2)  in  this  context  is  the 
problem  of  finding  the  lower  envelope,  and  the  scanline  is  vertical.  We  will  describe 
the  algorithm  in  two  passes:  first  we  give  an  easily-explained  but  slower  method, 
and  then  we  describe  the  modifications  needed  to  make  it  linear. 

If  a  segment  3  has  endpoints  p,-  and  pj  with  x(pi)  <  x(pj),  let  us  define  its  left 
and  right  endpoint  rzinks  /(s)  =  x(pi)  and  r(s)  =  x(pj).  In  what  follows  we  will 
often  identify  an  endpoint  p  with  its  rank  x{p). 

The  solution  to  Problem  (1)  gives  us  the  segments’  endpoints  in  sorted  or¬ 
der;  running  through  the  list  from  left  to  right  corresponds  to  sweeping  a  scanline 
through  the  segments.  If  the  list  gives  each  segment’s  length  along  with  its  left 
endpoint,  an  algorithm  has  enough  information  to  compute  the  lower  envelope  to 
the  right  of  the  scanline  (the  right  envelope)  for  those  segments  that  start  to  the 
scanline’s  left.  Since  every  segment  that  contributes  to  the  right  envelope  must 
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cross  the  scanline,  this  envelope  looks  like  a  staircase,  monotonically  ascending  to 
the  right.  Every  segment  in  it  is  lowest  just  once,  from  its  right  endpoint  part  way 
back  to  the  scanline.  See  Figure  2.5  for  an  example.  The  right  envelope  is  important 
for  the  following  reason:  when  the  scanline  moves  from  x  =  A:tox  =  fc  +  l  (all 
points  Pi  with  x{pi)  <  k  have  been  processed),  the  lowest  segment  of  the  current 
right  envelope  is  visible(k). 


% 
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Figure  2.5.  A  right  envelope 


This  observation  suggests  a  way  to  build  the  lower  envelope  incrementally:  we 
just  need  to  maintain  the  current  right  envelope  as  the  scanline  sweeps  across  the 
segments.  For  a  given  position  of  the  scanline  x,  let  the  set  of  segments  in  the 
right  envelope  be  ^7  =  {ei,  62, ... ,  Cm}-  This  is  a  subset,  possibly  a  proper  one,  of 
the  segments  whose  intervals  contain  x.  Let  the  right  endpoint  rank  of  a  segment 
Ci  €  E  he  ri  —  r{ti).  Number  the  elements  of  E  so  that  x  =  ro  <  ri  <  r2  <  •  •  •  < 
<  >'m+i  =  00.  The  left  endpoints  of  segments  in  E  are  all  at  or  to  the  left  of  x. 
The  segments  themselves  are  comparable  imder  the  -<,  relation,  and  e,-  c,+i  for 
1  <  i  <  m. 

Let  3  be  a  segment  whose  left  endpoint  is  encountered  by  the  scanline  at  x.  The 
right  endpoint  r(3)  falls  somewhere  in  the  staircase  pattern,  that  is,  r,-  <  r(3)  <  r,+i 
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for  some  0  <  i  <  m.  Once  the  position  of  r(3)  in  the  endpoint  list  is  known,  the 
lower  envelope  can  be  updated  in  consteint  amortized  time  per  segment.  Let  i  be 
the  index  such  that  r,-  <  r(3)  <  r^+i.  If  e,+i  3,  then  3  does  not  contribute  to 
the  lower  envelope  and  may  be  dropped.  On  the  other  hand,  if  3  -<,  ej  for  some 
j  <  i,  then  segment  ej  should  be  dropped  from  the  current  right  lower  envelope.  If 
r(S)  =  r,+i  and  3  c,+i,  then  the  segment  Cj+i  should  also  be  removed  from  E. 

When  the  scanline  encounters  a  right  endpoint,  the  corresponding  segment  may 
or  may  not  appear  in  the  current  right  envelope.  If  the  segment  is  not  in  the  right 
envelope,  then  no  action  is  needed.  If  the  segment  does  appear  in  the  right  envelope, 
then  it  is  ei,  the  lowest  segment  in  it.  (It  could  not  be  Ci  for  t  >  1,  since  that  implies 
ri  >  r,-,  a  contradiction.)  In  this  case  cj  must  be  removed  from  E  and  the  segments 
below  it  advanced.  Segment  62  becomes  Ci,  eind  so  on. 

The  implementation  of  the  algorithm  in  Figure  2.6  keeps  E  as  &  list  ordered 
by  r(s),  modifying  it  as  the  scanline  sweeps  across  endpoints  from  left  to  right.  In 
the  code,  X  is  a  sorted  list  of  right  and  left  endpoints  (two  per  segment  in  5,), 
each  associated  with  its  segment.  The  program  manipulates  the  list  E  using  stan¬ 
dard  list  operators  Delete{s),  Insert{s,t),  PTed{3),  and  Head{E),  which  respec¬ 
tively  delete  s  from  Ey  insert  s  after  t  in  Ey  return  the  predecessor  of  s  in  Ey 
and  return  the  element  s  in  E  with  minimum  r(s).  To  insure  that  the  Insert 
operator  can  always  be  applied,  the  list  E  is  preceded  by  a  dummy  segment  cq. 
The  program  also  uses  a  nonstandard  function  Shortest AsLong As  (s)y  which  re- 
ttums  the  segment  s*  in  Ey  if  any,  ending  immediately  to  the  right  of  s.  (Formally, 
r{s*)  =  min{r(s')  |  a'  G  jE7,  r(s')  >  r(s)}.)  Note  that  in  the  algorithm  of  Figure  2.6, 
E  never  contains  two  segments  3  and  t  with  r(a)  =  r{t).  This  fact  is  important  in 
the  implementation  of  Insert  given  in  Figure  2.7. 

Except  for  the  problem  of  locating  r(3)  among  the  endpoints  of  Ey  the  list 
operations  in  the  program  of  Figure  2.6  can  be  performed  in  linear  time  overall 
using  doubly-linked  lists.  Unfortunately,  the  one-dimensional  point-location  prob¬ 
lem  solved  by  the  function  Shortest AsLongAs  {3)  requires  fi(logn)  average  time  per 
query  in  the  general  case,  as  well  as  more  complicated  data  structures,  and  hence 
the  algorithm  presented  so  fax  runs  no  faster  than  Ct{n  log  n)  in  the  worst  case. 
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lastJe  0; 

Initialize  E,  the  current  right  envelope,  to  be  empty  except  for  the 
dummy  element  co; 
while  X  is  non-empty  do 
begin 

Remove  the  next  endpoint  p  from  X\  curJt  :=  x(p); 
for  k  :=  lasiJe  to  curJc  —  1  do 
visible{k)  :=  Head(E)] 
lasiJt  :=  curJc] 

Let  3  be  the  segment  associated  with  p; 
if  p  is  the  left  endpoint  of  a  then 
begin 

t  :=  ShortestA3LongA3{s); 

/*  Locates  r(s)  in  (r©, ri, . . . , r,n+i).  */ 
if  3  then 

/*  3  is  not  obscured  by  t.  */ 
begin 

p  :=  Pred{t); 

if  r(s)  =  r{t)  then  Deleie(t); 

Inaeri^a^p)] 

/*  Insert  s  after  p  in  E.  Note  that  E  never  contains  two 
segments  with  equal  right  endpoints.  */ 
while  p  ^  0  and  s  -<,  p  do 
begin 

p'  :=  Pred{p)\  Delett{p)\  p  :=  p'; 
end 

end 

end 

else  /*  This  is  a  right  endpoint.  */ 
if  s  =  Head{E)  then  Delete{Sead(E)y, 

/*  else  already  deleted.  */ 

end 

Figure  2.6.  A  scanline  solution  to  the  visibility  problem 
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Modification  of  the  segments  can  be  used  to  construct  a  linear  algorithm.  As  a 
first  step,  note  that  two  lower  envelopes  (essentially  segment  lists  with  at  most  one 
segment  in  any  interval  [fc,  k  +  1])  can  be  merged  into  their  common  lower  envelope 
in  linear  time.  In  particular,  if  an  algorithm  splits  each  segment  into  two  parts  and 
independently  computes  the  lower  envelopes  of  the  left  parts  and  the  right  parts,  the 
resulting  pair  of  lists  can  be  merged  in  linear  time.  This  observation  will  be  useful 
if  we  can  find  a  way  of  splitting  the  original  segments  that  makes  point  location 
easy  in  the  resulting  subproblems.  To  this  end  we  define  the  following  function  of 
two  integers. 

The  least  common  ancestor  lea  (a,  b)  of  a  pair  of  positive  integers  a  <  b  ia  the 
integer  c,  a  <  c  <  b,  with  a  maximal  number  of  trailing  zeroes  when  written  in 
binary.  We  can  write  a  and  b  in  binary  as 

a  =  aO/S 
b  =  O'  1 7, 

where  the  lengths  of  /?  eind  7  are  both  equal  to  some  integer  k.  Then  lca(a,  b)  = 
a  1 0*  unless  /?  =  0*,  in  which  case  lca{a,  b)  =  a.  For  example,  /ca(5, 15)  =  8  = 
(1000)3  and  /ca(14, 15)  =  14.  The  lemma  below  shows  that  lca{a,b)  is  unique.  A 
trivial  but  useful  fsu:t  is  that  lca(a,  b)  =  lca{a,  lca{a,  b)).  The  lea  function  can  be 
computed  in  a  constant  number  of  operations  xmder  the  RAM  model,  as  shown  by 
Harel  [HarSO]. 

Lemma  2.3.  If  lca{a,  6)  =  c  =  d  2*  for  some  odd  integer  d,  then  no  d  c  in  the 
interval  [a,  b]  is  divisible  by  2*. 

Proof:  Suppose  to  the  contrary  that  there  is  some  d  ^  c,  a  <  d  <  b, 
such  that  d  =  d'2*  for  some  odd  integer  d!  ^  d.  Then  there  is  an  even 
integer  d  strictly  between  d  amd  d',  d  =  d2^  for  7'  >  0  and  d  odd.  But 
then  c  =  d  2‘  =  d  2*'^^  is  strictly  between  a  and  6  and  has  more  trailing 
zeroes  than  c,  contradicting  the  fact  that  c  =  lca{a,  b).  | 

Let  us  consider  splitting  segments  at  the  point  indexed  by  the  least  common 
ancestor  of  their  endpoint  ranks.  That  is,  we  split  a  segment  s  with  Ix{s)  =  [/,  r]  into 
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segments  si  and  S2  with  Ix{si)  =  [l,lca(l,r)]  and  Ixis2)  =  [/co(/,r),r].  Note  that 
lca(l,r)  may  be  equal  to  I  or  r,  and  thus  the  splitting  may  not  divide  the  segment 
at  all.  Let  C  be  the  set  of  segment  left  portions  and  R  be  the  symmetrically  defined 
set  of  right  portions.  Since  finding  the  lower  envelope  of  %  is  the  mirror  image  of 
finding  that  of  £,  we  will  consider  only  the  latter  case.  Let  us  classify  segments 
in  C  by  the  number  of  trailing  zeroes  in  the  binary  representation  of  their  right 
endpoints.  (This  is  just  the  number  of  trailing  zeroes  in  the  least  common  ancestor 
of  the  original  segment  endpoints.)  To  do  this  we  introduce  a  zero-counting  function 
z(i)  whose  value  is  the  integer  j  such  that  i  —  d2^  with  d  odd;  we  use  this  function 
to  classify  each  segment  s  €  £  according  to  the  value  of  z(r(s)).  The  following 
lemma  is  crucial  to  the  linearity  of  our  algorithm: 

Lemma  2.4.  For  any  x  and  j,  let  Uj  be  the  set  of  segments  in  C  that  touch  x 
and  have  z(r{s))  =  j,  that  is,  Uj  —  {s  6  £  |  /(s)  <x  <  r(s)  and  z(r(s))  =j}. 
Then  the  number  of  distinct  right  endpoints  of  segments  in  Uj  is  at  most  one: 
|{i  I  i  =  r(s),  s  €  Uj}\  <  1. 

Proof:  Suppose  that  two  segments  si,S2  €  Uj  have  different  right  end¬ 
points,  r(si)  <  r(s2).  Then  Xr(s2)  includes  at  least  two  integers,  r(si) 
and  r(s2),  that  are  divisible  by  2^.  But  since  r(s2)  =  lca(/(s2)>^(52))> 
this  contradicts  the  uniqueness  of  the  lea  function.  | 

This  lemma  means  that  in  the  scanline  algorithm  given  above,  there  are  only 
0(log  n)  right  endpoints  in  the  current  right  envelope  at  any  given  time.  Further¬ 
more,  the  right  endpoints  of  segments  that  touch  the  scanline  position  x  are  ordered 
in  X  by  the  function  z(r(s)).  That  is,  for  two  segments  si  and  S2  that  include  x, 
r(si)  =  r(s2)  if  and  only  if  z(r(si))  =  z(r(s2)),  and  r(si)  <  r(s2)  if  and  only  if 
^(r(si))  <  2(r(s2)).  This  is  because  no  segment  s  £  C  has  an  interval  that  includes 
an  integer  i  with  z{i)  >  z(r(s)). 

The  algorithm  presented  in  Figure  2.6  keeps  exactly  one  segment  for  each  end¬ 
point  in  the  right  envelope.  Therefore  it  can  use  a  (flogn]  -|-  l)-bit  vector  to  keep 
track  of  the  segments  that  are  present.  Define  the  bit  vector  v  such  that  v[i]  =  1 
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if  and  only  if  a  segment  s  with  z(r(s))  =  i  is  present  in  the  current  right  enve¬ 
lope.  When  the  scanline  encounters  a  new  segment  5  with  2(r(3))  =  i,  the  segment 
with  which  5  must  be  compared  is  the  segment  s*  in  E  terminating  immediately 
to  the  right  of  $.  This  is  the  segment  corresponding  to  the  smzillest  j  >  i  such 
that  v\j]  =  1.  The  index  j  can  be  found  in  constant  time  given  v  and  i  using  bit 
operations  or  lookup  in  a  table  of  size  0{n).  (The  table  space  can  be  reduced  to 
0(n*)  for  any  fixed  e,  but  the  computation  time  increases  to  0(l/e).)  For  use  in  the 
subroutines  of  Figure  2,7,  let  us  define  the  function  /(v,  i)  to  be  the  least  integer 
j  >  i  such  that  v{j]  =  1. 

As  noted  above,  a  doubly-linked  list  of  segments  in  E  is  a  good  structure  for 
all  operations  of  the  algorithm  except  point  location.  To  get  from  the  index  j 
discussed  above  to  the  segment  s*  itself,  the  algorithm  keeps  a  ( [log  n]  +  l)-entry 
table  of  pointers  into  the  linked  list.  The  entry  je^[i]  points  to  the  segment  s  in  E 
with  z(r{s))  =  i,  if  any.  These  pointers  are  maintained  by  the  Insert  and  Delete 
functions  defined  in  Figure  2.7. 


2.5  Extensions 

In  this  section,  we  discuss  several  extensions  of  our  algorithms. 

(1)  The  dynamic  problem 

Either  of  the  algorithms  can  be  easily  extended  to  the  dynamic  problem,  where 
the  set  of  segments  is  updated  by  insertions  and  deletions.  We  can  execute  each 
update  (inserting  or  deleting  a  segment)  in  0(n)  time  without  increasing  the  other 
complexity,  since  the  line  arrangement  can  be  updated  in  linear  time  by  the  algo¬ 
rithms  in  [CGL85]  and  [EOS83],  and  the  triangulation  needed  by  the  first  algorithm 
can  be  updated  in  linear  time  by  the  algorithm  of  El  Gindy  and  Toussaint  [ET84]. 

(2)  Visibility  of  a  set  of  disjoint  polygons 

The  visibility  problem  from  a  point  for  a  set  of  disjoint  polygons  with  n  edges 
can  be  solved  in  the  same  complexity  by  applying  the  above  algorithms  to  the  edges 
of  those  polygons.  In  the  visibility  problem  for  disjoint  polygons,  it  is  considered 
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function  Shortest AsLong As (s) 
begin 
i  :=  z{r{s))\ 
j  :=  f{v,  t); 

/*  This  uses  bit  operations  or  table  lookup.  */ 
return(5e^[;]); 
end 

procedure /Twcr<(s,t) 
begin 

Manipulate  pointers  to  insert  s  after  t  in  E; 

i  :=  z(r(s)); 

t;[i]  :=  1;  seg[i]  :=  s; 

end 

procedure  Deleters) 
begin 

Manipulate  pointers  to  cut  s  out  of  E\ 
i  :=  2(r(s)); 
u[i]  :=  0;  seg[i]  :=  0; 
end 

Figure  2. 7.  Subroutines  used  by  the  scanline  algorithm.  Because  E  never 
contains  two  segments  s  and  t  with  r{s)  =  r(<),  the  Insert  procedure 
never  overwrites  data.  When  Insert(s,t)  is  called,  v[2:(r(s))]  =  0. 
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that  two  points  are  mutually  visible  if  the  open  segment  connecting  them  lies  in  the 
exterior  of  all  the  polygons  (when  a  polygonal  region  with  holes  is  concerned,  two 
points  are  visible  if  their  connecting  segment  lies  in  the  interior  of  the  boundary 
polygon),  so  the  algorithms  must  be  modified  slightly. 

(3)  The  parallel  view 

We  have  mainly  considered  the  view  from  a  point,  that  is,  a  perspective  view. 
However,  there  is  another  interesting  type  of  view,  called  a  parallel  view,  which 
consists  of  the  portions  of  the  objects  that  are  visible  from  a  given  direction.  Fig¬ 
ure  2.8  gives  an  example  of  a  parallel  view.  (Our  algorithms  reduce  the  problem 
of  finding  a  perspective  view  to  that  of  finding  a  parallel  view.)  The  problem  of 
finding  the  parallel  view  from  an  arbitrary  direction  can  be  solved  in  0(n)  time 
with  0(n^)  preprocessing  and  space  by  either  of  the  algorithms  given  above.  Be¬ 
cause  the  parallel  and  perspective  view  problems  are  equivalent  under  a  projective 
transformation,  we  can  map  one  problem  into  the  other  before  applying  the  algo¬ 
rithm.  This  improves  the  best  previously  known  algorithm,  due  to  Edelsbrunner, 
Overmars,  and  Wood,  which  uses  0(n)  search  time,  0(n^  log  n)  preprocessing  time, 
and  0(n^  log  n)  space  [EOW83]. 


Figure  2.8.  A  parallel  view 

(4)  A  lower  bound  on  the  visibility-polygon  search  problem 

There  is  no  algorithm  that  uses  0{n)  time  and  space  (including  preprocessing 
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steps)  to  find  the  visibility  polygon  from  a  point  for  h  disjoint  polygons  with  n  edges. 
We  can  botind  the  time  complexity  of  this  problem  from  below  by  Q(n  +  h  log  h). 

We  first  show  that  the  problem  of  sorting  h  positive  integers  can  be  transformed 
in  linear  time  into  the  problem  of  finding  the  visibility  polygon  from  a  point  for  a  set 
of  h  polygons.  Let  X  =  {ri,a;2, . . .  ,x;i}  be  a  set  of  h  distinct  positive  integers.  For 
each  Xi,  construct  a  triangle  with  vertices  pa,-  =  (2r<  +  2, 0),  psi+i  =  (2ar,-  + 1,  r<)  and 
P3i^2  =  —^i)  for  1  <  i  <  h.  Consider  the  problem  of  finding  the  visibility 

polygon  for  these  triangles  from  the  point  q  =  (0,0).  Since  the  slope  y/(2y  +  1) 
of  the  line  connecting  q  and  the  point  (2y  +  l,y)  with  y  >  0  is  strictly  increasing 
with  y,  the  points  pzi+i  for  1  <  i  <  h  are  visible  from  q.  We  can  obtain  the  sorted 
list  of  the  set  X  in  0{h)  time  once  we  have  the  visibility  polygon  from  q.  Prom 
the  preceding  argument  and  the  fact  that  it  takes  f2(n)  time  to  reaxl  the  polygon 
coordinates,  we  have  the  following  theorem: 

Theorem  2.2.  In  computation  models  where  Q(h  log  h)  is  a  lower  bound  for  sort¬ 
ing  h  integers,  il(n  +  h  log  h)  is  a  lower  botmd  on  the  time  complexity  of  finding  the 
visibility  polygon  hrom  a  point  for  h  disjoint  polygons  with  n  edges  altogether. 

Any  algorithm  that  can  find  the  visibility  polygon  for  a  set  of  n  non-intersecting 
segments  in  0(n)  time  requires  Q{n  log  n)  preprocessing  time.  Our  algorithms  find 
the  visibility  polygon  in  0(n)  time  with  0(n*)  preprocessing  time  and  space.  It  is 
a  challenging  open  problem  to  devise  an  algorithm  that  finds  the  visibility  polygon 
from  an  arbitrary  point  in  0(n)  time  with  o(n^)  preprocessing  time. 

In  the  case  where  all  the  polygons  are  convex,  we  can  obtain  an  £dgorithm  that 
meets  the  lower  botmd.  Consider  the  problem  of  finding  the  visibility  polygon  from  a 
point  for  a  set  of  h  (not  necessarily  convex)  disjoint  polygons  with  n  edges.  We  first 
compute  the  visible  portion  of  the  boundary  of  each  polygon  from  the  point  using 
linear-time  algorithms  [EA81],  [Lee83].  The  result  is  a  sequence  of  edges  from  each 
polygon.  Then  we  decompose  each  sequence  into  maximaJly  continuous  portions, 
where  two  consecutive  edges  on  the  visible  portion  are  continuous  if  they  were 
originally  consecutive.  Let  M  be  the  total  number  of  such  maximally  continuous 
portions.  (Note  that  no  two  such  portions  intersect.)  We  can  apply  a  circular  plane 
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sweep  method  to  find  the  visible  portions  by  sorting  2M  endpoints  in  the  order  of 
their  polax  angles  with  q  at  the  origin. 

The  algorithm  outlined  above  runs  in  0(n  +  M  log  h)  time  in  the  worst  case. 
Since  M  may  be  f2(n),  the  worst-case  time  complexity  is  Ct(n\ogh).  On  the  other 
hand,  if  every  polygon,  like  a  convex  polygon,  has  a  constant  nrunber  of  maximally 
continuous  portions,  then  the  algorithm  requires  only  0(n  -H  h  log  h)  time  and  is 
optimal  within  a  constant  factor. 

It  is  not  known  whether  the  above  approach  can  be  extended  to  give  an  algorithm 
that  finds  the  visibility  polygon  in  0{n)  time  with  0(n  -1-  hlog  h)  or  even  0(n  +  h^) 
preprocessing. 


2.6  The  visibility  graph  and  the  shortest-pat h 
problem 

The  visibility  graph  of  n  non-intersecting  segments  with  arbitrary  slopes  is  a  graph 
whose  vertices  are  endpoints  of  those  segments  and  whose  edges  axe  the  straight 
line  segments  joining  vertices  that  axe  visible  from  each  other.  This  graph  can  be 
constructed  in  O(n^)  time  and  space  by  solving  the  visibility  problem  from  each 
vertex  for  the  given  segments.  This  improves  the  0(n^  log  n)  time  bound  due  to 
Lee  [Lee78]  and  matches  the  O(n^)  result  of  Welzl  [Wel85].  (Note  that  in  the  worst 
case,  the  visibility  graph  has  edges.)  As  an  application  of  this  result,  the 

shortest  path  between  two  points  in  the  plane  with  polygonal  obstacles  having  n 
edges  can  be  computed  in  O(n^)  time,  improved  fi:om  0{n^  log  n),  since  the  problem 
can  be  solved  in  O(n^)  time  by  Dijkstra’s  algorithm,  provided  that  the  visibility 
graph  is  available. 

It  is  instructive  to  compare  Welzl’s  O(n^)  visibility  graph  construction  with 
the  algorithms  presented  here.  He  transforms  endpoints  into  lines  via  the  dual 
construction  and  computes  their  arrangement  in  O(n^)  time,  just  as  we  do.  For 
each  endpoint  pi,  he  uses  the  arrangement  to  sort  the  other  endpoints  by  the  slopes 
of  the  lines  they  determine  with  p,-.  Our  algorithms  use  this  information  to  find  the 
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visibility  edges  from  each  p,-  independently  of  the  other  points.  In  particular,  our 
second  algorithm  sweeps  the  other  points  with  a  scanning  ray  from  each  segment 
endpoint. 

Welzl’s  algorithm  is  a  global  one;  it  scans  the  directions  from  0  to  tt  once, 
maintaining  parallel  scanning  rays  from  all  segment  endpoints.  This  is  equivalent 
to  sorting  the  pairs  of  endpoints  by  the  slopes  they  determine  and  running 
through  the  list  once.  Welzl’s  method  keeps  track  of  the  first  segment  encountered 
by  the  ray  from  each  point.  When  a  sweeping  ray  (with  origin  p)  hits  a  point  q,  it  is 
easy  to  update  p’s  visible  segment.  Let  p’s  current  visible  segment  be  s  and  let  t  be 
the  segment  with  q  as  endpoint.  If  t  starts  at  q  and  obscures  s,  then  t  becomes  p’s 
new  visible  segment.  If  s  =  t,  then  s  ends  at  g;  p’s  new  visible  segment  is  g’s  current 
one. 

As  we  have  described  it,  this  algorithm  requires  sorting  the  endpoint  pairs  by 
slope.  However,  Welzl  observes  that  the  list  of  pairs  does  not  need  to  be  completely 
sorted;  the  method  still  succeeds  as  long  as  every  two  pairs  that  contain  only  three 
distinct  points  axe  correctly  ordered.  The  slope-sorted  list  of  lines  through  each  p, 
obtained  in  the  first  step  correctly  orders  all  endpoint  pairs  that  include  p,.  In 
0(n*)  time  Welzl  produces  a  satisfactory  order  of  all  pairs  by  applying  topological 
sort  to  the  N  ordered  lists. 

Welzl’s  algorithm  suggests  an  open  problem.  He  shows  that  the  visibility  graph 
application  does  not  require  sorting  the  directions  determined  by  N  points. 
Nonetheless,  that  problem  is  interesting  and  imsolved:  can  such  a  sorted  list  be 
produced  in  o(n^  log  n)  time? 

Our  algorithms  require  0(n^)  time  to  compute  the  visibility  graph,  but  the 
structure  they  construct  may  be  as  small  as  0(n),  as  in  Figure  2.9.  In  such  situ¬ 
ations  it  would  be  desirable  to  compute  the  visibility  graph  in  time  proportional 
to  its  size,  or  at  least  in  o(n^).  It  is  not  known  whether  this  is  possible  in  general; 
however.  Chapter  5  gives  an  almost-optimal  algorithm  for  the  special  case  in  which 
the  segments  form  a  simple  polygon. 

For  the  case  where  the  obstacles  are  h  convex  polygons  with  a  total  of  n  edges, 
several  shortest-path  algorithms  that  make  use  of  the  convexity  of  the  polygons 
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Figure  2.9.  A  visibility  graph  with  0(n)  edges 

are  known.  Rohnert  [Roh85]  showed  that  in  this  case  the  shortest-path  problem 
between  two  arbitrary  query  points  can  be  solved  in  0{h^  +  nlogn)  time  with 
0(n  +  h^logn)  preprocessing  time  and  0(n  +  h^)  space  (of  course,  h  can  be  ^(n), 
and  thus  the  time  complexity  is  fi(n^  log  n)  in  the  worst  case).  His  edgorithm  is 
based  on  the  fact  that,  when  the  polygonal  obstacles  are  convex,  shortest  paths  use 
edges  of  the  polygons  or  supporting  lines  of  pairs  of  polygons.  Based  on  this  fact, 
we  can  devise  a  shortest-path  algorithm  that  takes  0(hn  log  n)  time  and  0(n)  space 
(a  similar  result  appears  independently  in  a  paper  by  Papadimitriou  [Pap85]).  It  is 
not  known  whether  the  shortest-path  problem  in  this  case  can  be  solved  in  0{n+h^) 
time  and  space. 

By  using  our  algorithms,  we  can  also  find  the  shortest  path  between  two  segments 
in  the  plane  with  polygonal  obstacles  in  O(n^)  time.  (Here,  by  a  path  between  two 
segments,  we  mean  a  path  between  a  point  on  one  segment  and  a  point  on  the  other.) 
The  shortest  path  between  the  source  and  target  segments  may  start  or  end  with 
edges  connecting  interior  points  of  the  segments  to  obstacle  vertices.  If  we  augment 
the  visibility  graph  by  including  an  edge  corresponding  to  the  shortest  obstacle¬ 
avoiding  segment  from  each  obstacle  vertex  to  the  source  and  target  segments,  then 
we  can  proceed  as  above.  These  additional  edges  can  easily  be  computed  in  O(n^) 
total  time  as  part  of  the  visibility  computation  from  each  vertex. 
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2.7  Concluding  remarks 

In  this  chapter  we  have  given  two  algorithms  to  compute  the  visibility  polygon 
from  a  point  q  for  a  set  of  n  segments  allowed  to  intersect  only  at  their  endpoints. 
Both  algorithms  use  O(n^)  preprocessing  and  storage  to  construct  the  arrangement 
of  lines  corresponding  to  the  segment  endpoints  via  a  dual  transformation.  With 
this  arrangement,  the  algorithms  require  linear  time  to  find  the  polar  order  of  the 
segment  endpoints  around  the  query  point  q.  The  first  algorithm  uses  0(n  log  n) 
preprocessing  time  to  triangulate  the  set  of  segments  S ;  the  triangulation  allows  it 
to  compute  an  order  compatible  with  the  visibility  relation  -<,  in  linear  time.  It 
then  uses  the  Gabow-Tarjan  linear  set-union  algorithm  to  find  the  visibility  polygon. 
The  second  algorithm  splits  each  segment  in  Sq  at  the  polar  angle  indexed  by  the 
least  common  ancestor  of  the  indices  of  its  endpoints.  The  regularized  segments 
thus  produced  are  processed  by  a  scardine  algorithm  to  find  the  visibility  polygon. 

Our  algorithms  can  be  used  to  construct  the  visibility  graph  of  disjoint  poly¬ 
gons  with  n  edges  in  0{n^)  time  and  space.  This  leads  to  an  0(n^)  algorithm 
for  finding  the  shortest  obstacle-avoiding  path  between  two  points  in  the  plane. 
There  axe  several  possible  ways  to  improve  euid  extend  this  shortest  path  £ilgo- 
rithm.  One  extension  is  presented  in  the  following  chapter,  which  uses  the  same 
tools  as  this  chapter — the  visibility  graph  and  Dijkstra’s  algorithm — to  find  short¬ 
est  paths  for  a  non-rotating  convex  object.  By  talcing  an  alternative  approach, 
Reif  and  Storer  [RS85]  are  able  to  solve  the  shortest  path  problem  for  a  point  in 
0(n  log  n  ■+■  nk)  time,  where  k  is  the  number  of  polygons  formed  by  the  obstacle 
segments.  Given  a  fixed  source,  their  algorithm  builds  a  structure  that  can  be  used 
to  find  the  length  of  the  shortest  path  from  the  source  to  an  arbitrary  query  point 
in  O(logn)  time.  The  path  itself  can  be  found  in  additional  time  proportional  to 
the  number  of  turns  along  it.  Chapters  4  and  6  improve  Reif  and  Storer’s  bounds 
when  the  obstacle  segments  form  a  single  simple  polygon;  Chapter  6  shows  how  to 
answer  queries  when  both  endpoints  of  the  path  are  part  of  the  query. 


Chapter  3 

Shortest  Paths  for  a  Non-Rotating 
Convex  Body 


Pooh  Bear  stretched  out  a  paw,  and  Rabbit  pulled  and  pulled  and 

pulled - 

“Ow!”  cried  Pooh.  “You^re  hurting!** 

“The  fact  is,**  said  Rabbit,  “you’re  stuck.** 

“It  all  comes,**  said  Pooh  crossly,  “of  not  having  front  doors  big 

enough.  ** 

—  A.  A.  Milne,  Winnie-the-Pooh  (1926) 


In  the  last  chapter  we  saw  how  to  find  shortest  paths  for  a  point  moving  among 
polygonal  obstacles;  the  algorithm  uses  O(n^)  time,  where  n  is  the  number  of  poly¬ 
gon  vertices.  The  problem  of  finding  shortest  paths  for  a  moving  body  of  finite 
size  is  more  difficult.  In  the  special  case  when  the  body  is  a  non-rotating  convex 
polygon  of  fixed  complexity,  a  simple  extension  of  the  point-motion  algorithms  runs 
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in  0{n^)  time.  Baker  and  Chew  propose  an  O(n^logn)  algorithm  to  find  shortest 
paths  for  a  disk  moving  among  polygonal  obstacles  [Bak85,Che85].  An  extension  of 
Reif  zind  Storer’s  algorithm  (discussed  in  Section  2.7)  gives  an  O(n^)  solution  to  the 
same  problem  [RS85].  Their  method  exploits  the  geometry  of  the  obstacle  space  to 
run  faster  than  O(n^)  under  some  conditions. 

In  this  chapter  we  solve  a  generalization  of  the  disk  motion  problem.  We  show 
how  to  find  shortest  paths  for  a  non-rotating  convex  body.  The  obstacles  we  consider 
are  disjoint  simple  polygons,  as  in  the  work  cited  above;  our  method  is  novel  because 
the  moving  body  is  not  constrained  to  be  polygonal  or  round,  but  only  convex.  The 
arbitrary  shape  of  the  convex  body  is  not  without  its  costs,  but  we  defer  discussion 
of  these  costs  until  Section  3.4. 

Our  approach  uses  two  ideas  from  the  literature  to  simplify  the  problem  of 
finding  shortest  paths.  The  first  simplification  reduces  the  problem  of  moving  a 
non-rotating  convex  object  among  polygons  to  that  of  moving  a  point  (the  object’s 
center)  among  “fattened”  versions  of  the  obstacles.  This  idea  is  due  to  Lozano- 
Perez  zmd  Wesley  [LW79].  The  second  simplification,  due  to  Balcer  and  Chew 
[Bak85,Che85],  reduces  the  problem  of  finding  shortest  paths  for  a  point  among 
fattened  obstacles  to  that  of  finding  a  shortest  path  in  a  graph.  Dijkstra’s  shortest 
path  algorithm  for  graphs  [AHU74,  pages  207-209]  can  be  used  to  solve  the  reduced 
problem. 

The  shortest  path  algorithm  we  present  builds  the  graph  of  Baker  and  Chew, 
simplifies  it,  and  then  invokes  Dijkstra’s  algorithm.  Each  step  takes  O(n^)  time 
and  space.  The  visibility  graph  of  the  polygons,  obtained  using  the  methods  of 
the  preceding  chapter,  forms  the  basis  for  constructing  Baker  and  Chew’s  graph. 
If  the  graph  so  constructed  has  fi(n^)  nodes,  then  running  Dijkstra’s  algorithm  on 
it  will  take  ft(n*logn)  time,  causing  a  bottleneck.  To  circumvent  this  potential 
problem,  the  algorithm  reduces  the  graph  to  an  equivalent  one  with  only  0(n) 
nodes  before  running  Dijkstra’s  algorithm.  Using  the  Fibonacci  heaps  of  Fredman 
and  Tarjan  [FT84],  Dijkstra’s  algorithm  takes  only  0{n^)  time  when  applied  to  the 
reduced  graph. 
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3.1  Definitions 

This  section  gives  the  special  notations  used  in  this  chapter.  It  also  explains  the 
idea  of  “fattening”  obstacles  mentioned  in  the  introduction. 

We  represent  individual  points  by  lowercase  letters  and  sets  of  points  by  upper¬ 
case  letters.  The  segment  between  points  p  and  q  is  pq,  Jind  the  distance  from  p  to  q 
is  \pq\.  This  distance  is  usually  the  straight-line  distance  between  the  points,  though 
we  sometimes  use  to  refer  to  the  distance  from  p  to  q  along  the  boundary  of  a 
convex  region  on  which  both  points  lie. 

We  refer  to  the  moving  convex  body  as  the  robot  and  represent  it  by  the  letter  A. 
The  robot  must  avoid  obstacles  as  it  moves.  These  obst^M:les  are  disjoint  simple 
polygons  with  a  total  of  n  vertices;  the  robot  may  not  touch  the  interior  of  any 
polygon.  (The  polygons  must  be  disjoint  to  satisfy  the  preconditions  of  the  visibility 
graph  algorithms  of  Chapter  2.)  The  set  of  all  polygon  points,  both  vertices  and 
points  on  segments,  is  5. 

The  robot  A  has  a  center,  which  is  just  the  coordinate  origin  in  its  frame  of 
reference.  In  that  frame,  B  is  the  point-wise  reflection  of  A  through  its  center. 
(Note  that  the  center  of  A  need  not  lie  within  its  boimdary.)  We  denote  the  set  of 
points  covered  by  B  when  its  center  is  placed  at  a  point  q  by  (pronoimced  “S 
at  q”). 

To  plan  the  motion  of  the  robot  among  polygons,  we  solve  the  equivalent  problem 
of  moving  the  robot’s  center  among  fattened  versions  of  the  polygons.  To  distinguish 
between  the  obstacles  of  the  original  problem  and  their  fattened  versions,  we  refer 
to  the  polygons  that  the  robot  avoids  as  obstacles  and  to  the  fattened  polygons 
that  the  robot’s  center  avoids  as  barriers.  Points  on  the  boundaries  of  barriers  are 
especially  important  to  our  algorithm;  we  will  refer  to  these  as  boundary  points. 

We  can  draw  the  barriers  by  using  B  as  a  paintbrush,  placing  its  center  at  every 
point  of  the  polygon  boundaries  and  interiors;  the  painted  areas  are  the  barriers  that 
the  robot’s  center  must  avoid.  The  painted  region  is  the  Minkowski  sum  (or  vector 
sum)  of  B  with  the  polygons  and  their  interiors.^  The  botmdziry  of  the  painted  area 


^The  Minkowski  sum  of  two  regions  X  and  Y  consists  of  all  points  expressible  as  a  vector 
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is  closely  related  to  the  convolution  of  the  boundary  of  B  with  the  polygons  in  5, 
as  defined  by  Guibas,  Ramshaw,  and  Stolfi  [GRS83]. 

Any  point  on  the  boimdary  of  a  barrier  must  be  on  the  boundary  of  B^  for 
at  least  one  polygon  point  q,  and  it  may  not  lie  inside  B'^'  for  any  other  q'  in  5. 
Given  a  barrier  boundary  point  p,  we  refer  to  a  polygon  point  q  such  that  p  is 
on  the  boundary  of  B^  as  a  generator  of  p  and  denote  the  set  of  generators  of  p 
by  sr(p).  Unless  the  boundary  of  B  contains  a  segment  parallel  to  one  of  the  polygon 
segments,  the  number  of  generators  |y(p)|  is  finite  for  any  point  p  on  the  boundary 
of  a  barrier.  In  fact,  of  those  boundary  points  p  with  finitely  many  generators,  only 
a  finite  number  have  |y(p)|  >  1.  (We  sometimes  give  the  term  generator  a  slightly 
broader  meaning;  we  say  that  q  generates  the  barrier  B^.) 

We  can  classify  each  barrier  boundary  point  p  by  its  generator  set  g{p).  We  use 
the  term  arc  to  denote  a  maximal  connected  set  of  boundary  points  generated  by 
a  single  polygon  vertex.  The  barrier  boundaries  are  composed  of  arcs,  of  straight 
segments  whose  generators  all  lie  on  a  single  polygon  segment,  and  of  intersection 
points  of  these  elements.^  Each  arc  copies  a  portion  of  the  border  oi  B.  If  an  obstacle 
polygon  is  convex,  then,  in  the  fattened  polygon,  each  arc  bridges  the  difference  in 
slopes  between  the  segments  that  precede  and  follow  it.  (See  Figure  3.1.)  When 
two  arcs  or  segments  intersect  other  than  by  abutment,  the  point  of  intersection 
has  multiple  generators.  (An  arc  and  its  adjacent  segment  intersect  at  their  point 
of  abutment,  but  that  point  has  only  a  single  polygon  vertex  as  its  generator.) 


3.2  The  path  graph 

Since  combinatorial  problems  are  often  easier  to  solve  than  geometric  ones,  we 
reduce  the  shortest  path  problem  to  a  combinatoried  problem  by  introducing  the 

sum  X  +  y,  where  x  &  X  and  y  £  Y.  The  sum  is  symmetric  in  its  arguments — it  is  the  set 
{ar  +  y  I  *  6  A,  y  €  Y} — but  it  can  adso  be  viewed  asymmetrically  as  (J* 

^These  definitions  can  break  down  if  the  boundary  of  B  contains  a  segment  parallel  to  one  of  the 
polygon  segments.  In  this  case,  boundary  arcs  and  segments  may  not  be  disjoint.  To  remedy  the 
problem,  we  use  the  technique  of  c-perturbation:  we  define  the  boundary  arcs  and  segments  as  if  the 
polygon  segment  were  not  parallel  to  the  flat  spot  on  5,  but  rotated  clockwise  by  an  infinitesimal 
amount  6.  This  results  in  a  consistent  definition  of  disjoint  arcs  and  boundary  segments. 
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Figure  3.1.  When  a  convex  obstacle  polygon  is  fattened  by  the  inverted 
robot  B,  the  outer  boundary  of  the  resulting  barrier  has  arcs  and  seg¬ 
ments.  The  tangent  at  each  of  an  arc’s  endpoints  matches  the  slope 
of  the  adjacent  segment.  The  tangent’s  direction  varies  monotonically 
between  these  two  extremes  along  the  arc. 

path  graph.  Our  path  graph  is  a  combinatorial  graph  structure  obtained  from  the 
positions  of  the  barriers  and  is  an  extension  of  the  one  used  by  Baker  and  Chew. 
Each  path  graph  edge  corresponds  to  a  path  among  the  barriers.  We  show  that 
except  for  its  initial  and  final  segments,  any  barrier-avoiding  shortest  path  is  a 
subset  of  the  path  graph  edges. 

The  path  graph  of  a  set  of  barriers  is  closely  related  to  the  visibility  graph  of 
a  set  of  polygons,  which  we  discussed  in  Chapter  2.  The  visibility  graph  records 
pairs  of  mutually  visible  vertices;  similarly,  the  path  graph  records  pairs  of  barriers 
connected  by  common  tangents  that  avoid  all  other  barriers.  Because  shortest  paths 
for  the  robot’s  center  follow  barrier  boundaries  as  well  as  common  tangents,  the 
path  graph  has  two  kinds  of  edges:  portions  of  the  barrier  boundaries  and  tangent 
segments. 

Definition  3.1.  The  nodes  and  edges  of  the  path  graph  are  defined  as  follows: 

(1)  Every  maximal  straight  boundary  segment  generated  by  a  single  polygon  seg¬ 
ment  is  an  edge  of  the  path  graph,  and  its  endpoints  are  path  graph  nodes. 
This  implies  that  when  a  segment  generated  by  a  polygon  segment  intersects 
arcs  or  other  boundary  segments,  each  intersection  is  a  path  graph  node. 
Intersections  of  boundary  arcs  are  also  path  graph  nodes. 
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(2)  If  a  line  tangent  to  two  arcs  does  not  intersect  any  barrier  between  its  points 
of  tangency,  the  two  tangent  points  are  path  graph  nodes,  and  the  segment 
connecting  them  is  a  path  graph  edge.  If  the  tangent  touches  an  arc  at  more 
than  one  point  (the  arc  contains  a  straight  segment),  we  use  the  tangent 
point  that  minimizes  the  length  of  the  tangent  segment.  These  edges  are 
called  tangent  edges. 

(3)  Minimal  arc  sections  connecting  nodes  defined  in  (1)  and  (2)  are  path  graph 
edges  (no  two  such  edges  overlap).  These  edges  are  convex  curves;  some  may 
be  straight  line  segments,  but  only  if  the  boundary  of  B  has  Sat  spots. 

Path  graph  edges  defined  in  (1)  and  (3)  are  called  boundary  edges;  their  tmion 
includes  all  the  boundary  arcs  and  straight  segments. 

Recall  that  the  path  graph  of  a  set  of  barriers  is  the  analogue  of  the  visibility 
graph  of  a  set  of  polygons.  Since  the  visibility  graph  has  0(n*)  edges,  it  is  reasonable 
to  zissume  that  the  path  graph  also  has  0(rP)  nodes  and  edges.  The  following  lemma 
shows  that  this  assumption  is  true. 

Lemma  3.1.  The  path  graph  defined  above  has  0(n^)  nodes  and  edges. 

Proof:  We  begin  by  bounding  the  number  of  path  graph  tangent  edges. 
Consider  centering  a  copy  of  B  at  each  polygon  vertex.  These  n  (pos¬ 
sibly  overlapping)  convex  regions  have  at  most  common  tangents, 
since  two  translated  copies  of  B  have  at  most  four  common  tangents. 
Because  tangent  edges  are  common  tangents  of  boundary  arcs,  and  arcs 
are  generated  by  polygon  vertices,  the  path  graph  tangent  edges  are  a 
subset  of  the  O(n^)  tangents.  Each  tangent  edge  contributes  at  most 
two  endpoint  nodes  to  the  path  graph. 

Kedem  et  al.  [KLPS86]  show  that  there  are  only  0(n)  nodes  formed 
by  intersections  of  segments  and  arcs  on  the  boundary  of  the  barriers. 

(As  in  Figure  3.2,  there  can  be  0(n^)  intersections,  but  only  0(n)  of 
them  are  on  the  boundary.)  These  are  the  nodes  of  type  (1). 
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Figure  3.2.  Although  n  bzirriers  can  intersect  in  O(n^)  points,  only  a 
linear  number  of  the  intersections  are  on  the  boundary  of  the  barriers 

Every  path  graph  node  has  exactly  two  incident  boundary  edges. 

Since  there  are  0(n*)  nodes,  there  are  0(n^)  botindary  edges.  This 
fact,  in  combination  with  the  boimds  on  nodes  and  edges  already  given, 
proves  the  lemma.  | 

An  instance  of  the  shortest  path  problem  specifies  the  initial  and  final  positions 
a  and  u  of  A’s  center,  chosen  so  that  A*  and  A"  avoid  the  interiors  of  the  polygons. 
Shortest  paths  eonong  the  fattened  polygons  begin  and  end  with  edges  &om  a  and  a;, 
but  intermediate  edges  come  from  the  path  graph. 

In  order  to  use  Dijkstra’s  algorithm  to  find  the  shortest  path,  we  need  a  graph 
whose  edges  contain  all  shortest  paths  from  a  to  a;.  We  produce  such  a  graph, 
which  we  call  the  augmented  path  graph,  Ga,  by  adding  nodes  and  edges  to  the 
path  graph.  If  the  segment  aiJ  does  not  intersect  any  barrier,  it  is  the  shortest  path 
and  belongs  in  C?*.  If  aw  is  blocked,  we  add  to  the  path  graph  all  unobstructed 
segments  passing  through  a  or  w  and  tangent  to  some  barrier.  The  endpoints  of 
the  added  segments  augment  the  set  of  path  graph  nodes.  Some  boundary  edges 
are  split  in  two  by  these  additional  nodes.  Because  there  are  at  most  4n  tangents 
from  a  and  w  to  barriers,  the  added  nodes  and  edges  leave  the  augmented  path 
graph  still  of  size  O(n^).  The  following  lemma  shows  that  Ga  is  useful  for  motion 
planning. 

Lemma  3.2.  Every  shortest  path  from  a  tout  that  avoids  the  barriers  follows  edges 
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of  the  augmented  path  graph,  Ga- 

Proof:  Any  shortest  path  is  composed  of  subpaths  that  alternately 
follow  barrier  boundaries  and  move  along  straight  lines  between  barriers. 
If  the  endpoints  of  the  segments  that  connect  barriers  are  augmented 
path  graph  nodes,  then  the  boundziry  subpaths  are  made  up  of  botmdary 
edges.  Thus  it  suffices  to  show  that  every  non-boimdary  segment  on  the 
shortest  path  is  an  edge  of  the  augmented  path  graph. 

Suppose  that  a  segment  ab  lies  on  a  shortest  path  from  a  to  u, 
touches  barriers  only  at  its  endpoints,  and  is  not  an  edge  of  the  aug¬ 
mented  path  graph.  At  least  one  of  aft’s  endpoints,  say  ft,  lies  on  a  barrier 
boundary  and  is  not  a,  u>,  or  a  point  of  tangency  of  aft  with  the  barrier. 
Because  ft  is  not  a  or  cj,  the  path  continues  beyond  ft.  Some  point  d  on 
the  continuation  must  be  visible  from  a  point  c  on  aft,  since  aft  is  not 
tangent  at  ft.  But  this  means  that  the  supposed  shortest  path  could  be 
shortened  by  replacing  the  subpath  from  c  to  d  via  ft  by  the  segment  cd, 
which  contradicts  the  assumption  that  aft  lies  on  a  shortest  path.  (See 
Figure  3.3.)  | 


Figure  3.3.  Barrier-connecting  segments  on  a  shortest  path  must  be 
tangents.  The  dark  path  through  a,  ft,  and  d  cannot  be  a  subpath  of 
any  shortest  path.  Because  ft  is  not  a  tangent  point  of  aft,  the  point  d 
on  the  continuation  of  the  path  is  visible  from  c.  Taking  the  shortcut  cd 
gives  a  path  shorter  than  the  dark  one. 


CHAPTER  3.  MOVING  A  NON-ROTATING  CONVEX  BODY 


39 


3.3  Tangent-visibility 

This  section  deepens  the  connection  between  the  path  graph  of  a  set  of  barriers  and 
the  visibility  graph  of  the  polygons  that  generate  the  barriers.  It  shows  that  the 
path  graph  tangent  edges  from  an  arc  are  closely  related  to  the  visibility  graph  edges 
from  the  vertex  that  generates  the  arc.  The  next  section  exploits  this  connection 
to  construct  the  path  graph  quickly. 


Figure  3.4.  The  visibility  polygon  Vis(S,g)  of  query  point  q  is  the 
region  visible  from  g  in  the  presence  of  the  polygonal  obstacles  in  5. 

The  circular  sequence  of  points  of  S  visible  from  q  is  VP(g). 

If  we  are  given  a  set  of  simple  polygons  with  n  vertices  and  a  query  point  g  outside 
the  polygons,  the  (uncountably  many)  polygon  points  visible  from  g,  listed  in  polar 
order,  form  a  cyclic  sequence  that  we  denote  by  VP(g).  The  points  of  VP(q)  lie  on 
the  boundary  of  the  visibility  polygon  Vis(S,g).  (See  Figure  3.4.)  We  can  define 
a  similar  concept  for  the  fattened  polygons  by  sweeping  them  with  a  ray  tangent 
to  whose  endpoint  moves  along  the  botmdary  of  B^.  If  the  tangent  point  of  the 
ray  is  not  inside  a  barrier  that  overlaps  5’,  we  say  that  the  first  boundary  point 
encountered  by  the  ray  is  tangent-visible  from  5*.  See  Figure  3.5  for  an  example. 
Each  query  point  q  has  two  sequences  of  tangent- visible  points,  since  each  boundary 
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point  of  B  has  both  a  clockwise  and  a  counterclockwise  tangent.  Both  sequences 
may  contain  points  of  tangency  that  are  path  graph  nodes;  since  the  two  sequences 
are  similar,  we  discuss  only  the  sequence  derived  from  the  clockwise  tzingent.  Let 
us  denote  the  sequence  of  barrier  boundary  points  swept  by  the  clockwise  tangent 
to  B^  by  TV(g).  The  generators  of  the  points  in  TV(q)  form  a  sequence  we  call 
g{TV(q)).  (If  a  point  p  in  TV(q)  has  multiple  generators,  any  one  of  them  may 
appe2ir  in  the  generator  sequence.) 


Figure  3.5.  Tangent-visibility.  The  darkened  section  of  the  boundary 
of  B^  and  J5’’  is  tangent- visible  from  B^. 

To  help  characterize  the  sequence  of  generators  g(  TV (g)),  we  introduce  some 
special  notation.  As  the  tangent  to  B’  sweeps  aroxmd,  it  points  in  each  direction 
between  0  and  2-k  exactly  once.  Let  p  be  a  point  and  T  a  set  of  points.  The  point  p 
generates  the  barrier  B^,  and  T  generates  barriers  that  are  the  Minkowski  smn  of  T 
and  B.  We  use  the  notation  to  refer  to  the  set  of  directions  in  [0,27r)  for 

which  the  tangent  from  B^  hits  B^  strictly  before  hitting  any  other  barrier  B’’  for 
r  ^  p  and  r  in  T.  (It  doesn’t  matter  whether  p  is  in  T  or  not.)  In  this  notation 
v^{q,p)  is  an  interval  (allowing  wraparound  across  27r)  of  measure  less  than  v.  The 
set  VT(q,p)  shrinks  as  T  grows;  that  is,  uru{r}(?»p)  Q  for  any  r.  The  set 

vs{q,p)  can  be  expressed  as  flrgs  V{r}(9>p)*  For  convenience,  we  let  Ur(9>p)  stand  for 
v^r}(<l,p)•  (See  Figure  3.6  for  an  example  of  the  notation.)  The  following  lemmas 
apply  this  notation  to  show  that  the  generators  of  the  tangent-visible  sequence 
TV{q)  are  a  subsequence  of  VP(q). 
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Figure  3.6.  Tangent- visibility  notation.  When  no  other  barriers  are 
present,  is  tangent-visible  from  in  the  shaded  sector  shown  in 
(a).  The  directions  of  the  rays  that  bound  the  sector  define  the  in¬ 
terval  vt(q,p).  When  barriers  B^  and  B*  are  added,  the  sector  of  tan¬ 
gent-visibility  is  reduced  to  the  shaded  region  shown  in  (b).  This  sector’s 
boundary  rays  define  V{r,*}(9>p)* 

Lemma  3.3.  If  p*  is  a  point  of  the  tangent-visible  sequence  TV(q),  all  of  its  gen¬ 
erators  are  in  VP(q). 

Proof;  Let  p  be  a  generator  of  p',  that  is,  p  €  ff(p')'  ^  P  ^  *•- 

polygon  point  r  lies  on  the  segment  connecting  p  and  q.  The  object  B' 
hides  every  point  of  S'*  from  the  sweeping  tangent,  as  shown  in  Fig¬ 
ure  3.7.  Because  Vr(q,p)  =  0,  its  subset  vs(q,p)  is  also  empty,  and  p' 
cannot  be  in  TV(q).  | 


Lemma  3.4.  For  any  set  of  barrier  generators  T,  VT(q,p)  is  an  interval. 

Proof:  Since  the  intersection  of  intervals  of  length  less  than  ir  results 
in  an  interval,  we  need  only  show  that  Vr(q,p)  is  an  interval  for  all  r. 
If  Vr{q,p)  is  not  an  interval  for  some  r,  then  let  a  <  5  <  c  be  direc¬ 
tions  in  the  interval  Vi{q,p)  such  that  a  and  c  are  in  Vr{q,p)  and  6  is 
not.  (We  have  assumed  without  loss  of  generality  that  v^{q,p)  does  not 
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Figure  3. 7.  hides  B^  from  B^ 

include  27r.)  In  Figure  3.8,  the  segments  UaVa,  up?,  and  xT^  have  di¬ 
rections  a,  6,  and  c.  Because  directions  a  and  c  are  part  of  Vriji,  P)i 
barrier  B^  cannot  intersect  the  segments  and  even  at  their 
endpoints. 


Figure  3.8.  Proof  that  Vr(q,p)  is  an  interval.  S’”  cannot  intersect 
(block  teingent-visibility  in  direction  b)  without  also  intersecting  either 
or  u^. 

We  show  that  B^  cemnot  intersect  and  hence  b  is  in  Vr{q,p). 

Consider  all  possible  placements  of  J3’’  that  intersect  the  shaded  region 
of  Figure  3.8.  Note  that  every  placement  that  abuts  the  region  without 
overlapping  its  interior  touches  UaVa  or  u^.  Since  the  shaded  region  is 
narrower  than  the  parallelogram  defined  by  the  outer  common  tangents 
to  and  B^,  any  placement  of  B^  that  overlaps  the  shaded  region’s 
interior  must  also  intersect  UaVa  or  u^.  Because  ujuj  is  contained  in 
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the  shaded  region,  B’’  CEinnot  intersect  it.  | 

Lemma  3.5.  If  points  p,  r,  and  s  belong  to  the  sequence  g(TV(g))  and  appear 
in  clockwise  order  around  q,  then  the  intervals  vs{q,p),  vs{q,r),  and  vs(q,s)  are 
disjoint  and  also  appear  in  clockwise  order. 

Proof:  The  interveJs  vs(q,p),  vs(q,r),  and  vs(q,s)  are  disjoint  because 
p,  r,  and  s  all  belong  to  S. 

The  ray  from  g  to  p  is  parallel  to  the  outer  common  tangents  from 
to  S’*.  Let  pt  (t  for  tangent)  be  the  direction  of  this  ray,  and  let  p„  (v  for 
visible)  be  any  direction  in  vs(q,p)>  We  define  r<,  r„,  and  similarly. 
These  six  directions  occmr  in  some  clockwise  order,  though  there  may 
be  degeneracies.  If  pt  =  Pv*  we  order  pt  and  pv  such  that  pv  immediately 
follows  Pt  in  clockwise  order.  If  p„  =  rt,  we  order  p„  and  r*  such  that  r* 
immediately  follows  pv  in  clockwise  order.  (The  first  condition  applies  to 
r  and  s  as  well,  and  the  second  applies  to  all  pairs  taken  firom  {p,  r,  s}.) 

See  Figure  3.9  for  an  example  of  this  notation.  We  want  to  show  that  if 
Pt,  rt,  and  St  occur  in  clockwise  order,  so  do  pv,  r„,  smd 


Figure  3.9.  The  tangent  directions  pt,  rt,  and  St  and  the  visibility  di¬ 
rections  p„,  r„,  and  The  cyclic  permutation  associated  with  these 
directions  is  (pt  rt  Pv  r^,  st  s^). 
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The  following  simple  condition  on  the  clockwise  sequence  of  direc¬ 
tions  is  stated  using  p  and  r,  but  it  applies  to  all  pairs  drawn  from 
{p,-r,  s}:  At  most  one  of  rt  and  r„  occurs  after  pt  and  before  Pv.  If 
(pt  ft  r„p„)  is  a  subsequence  of  the  cyclic  sequence  of  angles,  then  Vr{q,p) 
is  not  an  interval,  since  it  includes  both  pt  and  p„,  but  not  r^,  which  lies 
between  them.  This  contradicts  Lemma  3.4.  If  (pt  r„  rt  Pv)  is  a  subse¬ 
quence  of  the  cyclic  sequence  of  angles,  then  at  least  one  of  v^{p)  and 
v^{r)  is  larger  than  x,  also  a  contradiction. 

Of  the  possible  angle  sequences,  the  only  ones  that  satisfy  the  condi¬ 
tion  also  have  Pv,  r„  and  s„  in  order.  There  are  120  cyclic  permutations 
on  {p«,  Pv,  rt,  r„,  St,  s„}.  Of  these,  60  have  pt,  rt,  and  St  in  clockwise  order. 
Of  the  60,  only  eleven  meet  the  condition  given  above;  in  all  of  those,  pv. 
Tv,  and  3„  appear  in  clockwise  order.  If  we  class  together  permutations 
that  are  equivalent  under  renaming  of  p,  r,  and  s,  there  are  really  only 
five  different  permutations  that  satisfy  the  condition.  They  are  given  in 
Figure  3.10.^  This  proves  that  the  tangent-visible  points  generated  by 
p,  r,  and  s  occtir  in  the  same  clockwise  order  as  p,  r,  and  s  themselves. 

I 

Lemmas  3.3  and  3.5,  taken  together,  have  the  following  consequence: 

Theorem  3.1.  The  sequence  g{TV{q))  is  a  subsequence  of  VP(q). 


3.4  Construction  of  the  augmented  path  graph 

Since  any  shortest  path  from  a  to  a?  follows  edges  of  the  augmented  path  graph, 
our  first  step  in  finding  a  shortest  path  is  to  construct  that  graph.  The  graph  has 
two  kinds  of  edges,  boundary  edges  and  tangent  edges,  which  we  find  separately. 
(We  need  find  only  the  edges,  since  the  nodes  of  the  graph  are  given  by  edge 
intersections.) 

®By  an  elegant  geometric  argument,  Jim  Saxe  has  shown  that  there  are  no  barrier  configurations 
that  give  rise  to  permutations  of  Class  5.  (Private  communication,  1986.) 
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Class  4  Class  5 


Example  Equivalent  permutations 

Class  1:  (ptPvrtryStSv) 

Class  2:  (pt  Pv  rt  St  s„)  (pt  3„  p„  r*  s*)  (p«  r^,  St  3„) 

Class  3:  (pt  r*  Pv  r„  s^)  (pt  Pv  ^«)  (p<  Pw  ^t) 

Class  4:  (pi  n  s*  Pv  r„  Sv)  (p«  Tv  5v  Pv  r*  5*)  {pt  rt  3^  p„  s«) 

Class  5t  (pt  5t;  r t  Pv  ry) 

Figure  3.10.  Five  essentially  different  cyclic  permutations  satisfy  the 
condition  of  Lemma  3.5.  The  figure  shows  one  example  from  each 
class,  and  the  table  lists  its  equivalent  permutations,  normalized  to  start 
with  Pt. 
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To  find  the  boundary  edges,  we  construct  the  boundzuy  of  the  barriers,  that  is, 
the  border  of  the  region  where  the  robot’s  center  may  be  placed.  We  compute  this 
boundary  using  the  divide-and-conquer  method  of  Kedem  et  al.  [KLPS86],  which 
runs  in  time  O(rnlog^n);  here  r  is  a  constant  that  depends  on  B  and  which  will 
be  defined  later. 

To  find  the  tangent  edges,  we  construct  the  two  tangent-visible  sequences  from 
each  fattened  polygon  vertex  B’’.  The  sequence  TV{q)  and  its  counterclockwise 
counterpart  give  all  path  graph  edges  tangent  to  5*. 

The  results  of  the  preceding  section  provide  a  way  to  produce  the  tangent- visible 
sequence  TV(q)  from  VP{q),  which  we  can  find  using  the  methods  of  Chapter  2. 
After  0(n^)  preprocessing,  those  algorithms  take  0(n)  time  to  compute  the  visi¬ 
bility  polygon  for  any  particular  vertex.  (The  algorithms  require  that  the  obstacle 
segments  be  disjoint,  as  does  the  boundary  construction  of  Kedem  et  al.  mentioned 
above.) 

To  bound  the  time  it  takes  to  construct  TV(q),  we  introduce  some  conditions 
on  B.  These  conditions  are  not  restrictions  on  the  shape  of  B,  but  rather  char¬ 
acterizations  of  it.  Each  of  the  five  operations  listed  below  is  used  in  some  phase 
of  our  shortest  path  algorithm;  to  bound  the  algorithm’s  performance,  we  need  to 
bound  the  complexities  of  the  basic  operations.  We  therefore  assume  that  each  of 
the  following  operations  can  be  performed  in  time  r,  for  some  r  dependent  on  B: 

To  compute  TV(q)  from  VP{q),  we  must  be  able  to 

(1)  Find  intersection  points  of  two  translated  copies  of  B, 

(2)  Find  the  intersection  of  a  line  with  B,  and 

(3)  Compute  the  inner  and  outer  common  tangents  of  two  translated  copies 
ofB. 

To  compute  the  boundary  points  visible  from  the  source  and  destination 
points  a  and  w,  we  must  be  able  to 

(4)  Find  the  tangents  to  B  through  a  point. 
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To  compute  the  lengths  of  path  graph  edges  quickly,  we  need  to 

(5)  Compute  the  perimeter  distance  between  two  arbitrary  points  on  an  arc 
ofB. 

(Note  that  preprocessing  of  B  may  be  used  to  speed  up  distance  computa¬ 
tions.) 

Because  of  Theorem  3.1,  we  are  able  to  compute  TV{q)  using  an  algorithm  sim¬ 
ilar  in  spirit  to  the  Graham  scan  convex  hull  algorithm  [Gra72].  Roughly  speaking, 
we  replace  each  vertex  and  segment  in  VP{q)  by  the  barrier  it  generates,  then  find 
the  tangent-visible  points  on  the  imion  of  these  barriers.  Each  visibility  polygon 
vertex  p  contributes  B^  to  the  tmion,  and  each  segment  pf  in  VP(q)  contributes  a 
tube  U«€pr  •S*j  which  we  represent  compactly  by  B^.  (See  Figure  3.11.) 


Figure  3.11.  The  tube  B’^  and  its  internal  parallelogram 

It  is  convenient  to  replace  the  tube  by  the  parallelogram  defined  by  the 
endpoints  of  the  outer  common  tangents  of  B^  and  B^.  This  choice  removes  a 
possible  degeneracy  from  the  construction.  Each  arc  is  generated  just  once,  by 
a  vertex,  rather  than  three  times:  once  by  the  vertex  emd  twice  by  the  vertex’s 
incident  segments.  Computing  the  parallelogram  requires  operation  (3)  above. 

Each  barrier  generated  by  an  element  i  in  VP{q),  either  vertex  or  segment,  has  a 
range  of  angles  in  which  it  is  tangent- visible  from  B’’  if  no  other  barriers  are  present. 
This  range  is  v^{q^  i)  if  i  is  a  vertex;  for  convenience,  we  use  the  notation  v^{q,  i)  to 
represent  the  range  when  i  is  a  segment,  too. 


CHAPTER  3.  MOVING  A  NON-ROTATING  CONVEX  BODY 


48 


If  B'  intersects  J3’,  the  intersection  blocks  all  tangent- visibilities  in  the  angular 
range  it  covers.  The  range  in  which  the  barrier  generated  by  i  blocks  all  visibilities 
is  called  its  blocking  interval  b(i).  If  S*  and  S’  do  not  intersect,  b(i)  is  empty.  For 
clockwise-going  tangents  to  S’,  the  intersection  of  S‘  and  S’  is  clockwise  of  the 
tangent- visible  part  of  S*.  The  blocking  interval  b(i)  immediately  follows  vt(q,i)  in 
clockwise  order. 

When  more  than  one  barrier  is  present,  each  barrier  S‘  is  visible  in  some  subin¬ 
terval  of  Vi(g, p).  An  interval  in  which  a  barrier  S*  is  tangent- visible  may  be  followed 
in  clockwise  order  by  a  blocking  interval.  If  two  blocking  intervals  b{i)  and  b{j)  over¬ 
lap,  they  can  be  merged  into  a  single  joint  blocking  interval.  (See  Figure  3.12.)  We 
associate  this  joint  interval  with  the  tangent-visible  barrier  S’  that  precedes  it  in 
clockwise  order;  note  that  the  blocking  interval  5(i)  contributes  to  the  joint  blocking 
interval. 


Figure  3.12.  The  blocking  intervals  of  B^,  S’"  and  S'  are  merged  into  a 
single  joint  blocking  interval  and  associated  with  S’*. 

Theorem  3.1  states  that  the  generators  of  the  tangent- visible  sequence  TV(q) 
are  a  subsequence  of  VP{qy,  we  apply  the  theorem  to  construct  TV{q)  efficiently. 
Let  L  =  (ii,...,ik)  be  a  clockwise  subsequence  of  VP{q).  Each  element  *  in  L 
has  associated  with  it  a  blocking  interval  b(i)  that  may  be  a  joint  interval  (larger 
than  the  intersection  of  S*  with  S’).  Suppose  that  when  the  barriers  and  blocking 
intervals  given  by  the  elements  of  L  are  present,  every  such  barrier  is  tangent-visible 
from  S’.  Let  j  be  a  successor  of  ik  in  VP{q)',  that  is,  (ti, . . .  ,ikij)  is  a  subsequence 
of  VP{q).  Theorem  3.1  implies  that  if  B^  is  hidden  from  S’  by  barriers  and  blocking 
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intervals  given  by  elements  in  L,  it  is  hidden  by  the  union  of  ,  B’* ,  6(ii ),  and  6(u). 
On  the  other  hand,  if  hides  some  of  the  barriers  generated  by  elements  in  L, 
it  is  visible  itself.  Furthermore,  and  b(j)  hide  a  contiguous  initial  and  final 
subsequence  of  L.  That  is,  if  they  hide  a  part  of  J5®  for  some  x  in  L,  they  also  hide 
all  the  predecessors  of  x  in  X  or  all  the  successors. 

The  algorithm  of  Figure  3.13  exploits  these  observations  to  compute  TV(q).  It 
maintains  a  double-ended  linked  list  L  of  elements  of  VP{q).  Each  element  has 
associated  with  it  an  interval  of  current  visibility  v(i)  and  a  blocking  interval  6(i). 
(As  the  algorithm  runs,  i;(i)  shrinks  and  b(i)  grows.)  The  list  corresponds  to  the 
subsequence  L  described  above.  At  the  top  of  each  for  loop,  the  following  invariant 
holds:  L  contains  a  clockwise  subsequence  of  VP{q)  such  that  when  only  the  barriers 
and  blocking  intervals  given  by  L  are  present,  every  such  barrier  is  tangent- visible 
from  B*.  (At  the  end  of  execution,  L  contains  exactly  the  elements  that  generate 
TV{q).)  During  each  loop,  a  new  element  of  VP{q)  is  inserted  at  the  end  of  L.  The 
operations  necessary  to  restore  the  invariant  affect  only  the  ends  of  the  list. 

The  algorithm  uses  just  one  geometric  primitive:  Given  two  elements  from  the 
visibility  polygon  boundary  i  and  j  and  their  associated  blocking  intervals  b{i) 
and  5(j),  the  primitive  determines  all  tzmgent  visibilities  that  exist  when  only  the 
barriers  B*  and  B^  and  the  blocking  intervals  b{i)  and  6(j)  are  present.  If  the 
barrier  B*  is  not  tangent- visible  under  these  circumstances,  we  say  that  b{j)  and  B^ 
hide  B*.  The  primitive  requires  a  constant  niunber  of  the  tangent-  and  intersection¬ 
finding  operations  (l)-(4),  and  therefore  takes  0(r)  time.  (Operation  (4)  is  needed 
because  we  replaced  B*  by  a  parallelogram  for  each  segment  i  in  VP{q).) 

The  algorithm  runs  in  0{Tn)  time.  Each  execution  of  the  outer  for  loop  takes 
0(r)  time,  exclusive  of  the  time  spent  in  the  two  inner  while  loops.  Each  time  one 
of  those  loops  is  executed,  zm  element  of  L  is  deleted.  No  more  than  n  elements  are 
ever  added  to  X,  so  the  inner  loops  are  executed  at  most  n  times  altogether;  each 
execution  takes  0{t)  time. 

The  algorithm  and  the  preceding  discussion  constitute  a  proof  of  the  following 
lemma. 
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Set  L  to  be  an  empty  double-ended  queue.  The  functions  head{L)  and 
tail{L)  return  the  elements  at  the  ends  of  the  queue, 
for  each  i  in  VP(q)  in  clockwise  order  do 
begin 

Let  6(i)  be  the  interval  given  by  the  intersection  of  B*  and  B^ 
while  5*  and  b{i)  hide  in  the  interval  v{head{L))  do 

begin 

Merge  b(kead{L))  into  6(i); 

Delete  kead{L)  from  L; 
end 

while  JB*  and  6(z)  hide  in  the  interval  v(iail(L))  do 

begin 

Merge  b(tail{L))  into  5(i); 

Delete  iail(L)  from  L; 
end 

Let  U  be  the  set  of  barriers  and  blocking  intervals  B***^^^\ 

b(head{L)),  b{tail{L)),  and  6(*); 

(The  following  test  uses  the  primitive  operation  twice:) 
if  B*  is  tangent- visible  from  in  the  presence  of  U  then 

begin 

Determine  the  endpoints  of  v(i)  by  comparisons  with  U ; 
Adjust  the  endpoints  of  v(head(L))  and  v(tail(L))  if  B*  and 
6(i)  affect  them; 

Insert  i  at  the  tail  of  L; 

(The  barriers  generated  by  t,  head{L),  and  tail{L)  are  all  vis¬ 
ible  and  separate  their  associated  blocking  intervals,  so  no 
blocking  intervals  need  to  be  merged.) 

end 

else  (B*  is  not  teingent-visible) 

Merge  b{i)  into  b(tail(L)); 

end 

Figure  3.13,  An  algorithm  to  compute  TV{q) 
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Lemma  3.6.  If  each  of  the  operations  (1),  (2),  (3),  and  (4)  given  above  can  be 
performed  in  time  r,  then  the  tangent-visible  sequence  TV(q)  can  be  produced 
from  VP(q)  in  0(Tn)  time. 

When  the  algorithm  terminates,  L  contains  g(TV(q)),  the  generators  of  the 
barriers  that  are  clockwise  tangent-visible  from  It  is  easy  to  find  the  path 
graph  edges  that  are  clockwise-tangent  to  5®  given  the  list  L.  For  each  element 
i  €  Lf  if  either  end  of  the  visible  range  v(i)  is  delimited  by  a  common  tzmgent  of 

and  B*,  then  we  add  the  tangent  edge  to  the  path  graph. 

We  can  construct  the  tangent  edges  of  the  path  graph,  but  we  still  need  to  find 
the  edges  from  a  and  u;  that  augment  the  path  graph.  However,  the  ideas  used  in 
Section  3.3  Jind  the  preceding  lemma  apply  to  this  problem  as  well. 

Lemma  3.7.  The  edges  from  a  and  u  that  augment  the  path  graph  can  be  foxmd 
in  0(rn)  time. 

Proof:  Lemmas  3.3  through  3.5  discuss  tangent-visibility  of  barriers. 
Similar  lemmas  are  true  for  visibility  of  barriers  from  a  point.  For  ex¬ 
ample,  if  r  lies  on  the  segment  connecting  a  and  p,  no  point  of  B’’  is 
visible  from  a,  which  proves  a  lemma  analogous  to  Lemma  3.3.  Let 
vt{9,p)  have  the  same  meaning  for  visibility  from  q  as  VT{q,p)  has  for 
tangent- visibility  from  B^.  Then  vx(g,p)  is  an  interval  for  any  T:  as  in 
the  proof  of  Lemma  3.4,  we  consider  %{q,p)  for  any  r  and  show  that  B*" 
cannot  intersect  in  Figure  3.14  without  touching  or 

The  analogues  of  Lemma  3.5  and  Theorem  3.1  follow  directly  from 
the  modified  Lemmas  3.3  and  3.4,  and  the  Graham  scan  of  Lemma  3.6 
is  easily  modified  to  handle  visibility  from  a  point,  given  that  B  satisfies 
condition  (4)  given  above.  | 

This  section  has  described  three  separate  aspects  of  constructing  the  path  graph. 
The  method  of  Kedem  et  al.  finds  the  boundary  of  the  barriers  in  0(nr  log^  n)  time. 
The  algorithm  given  in  Figure  3.13  produces  the  path  graph  tangent  edges  in  O(n^r) 
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Figure  3.14.  Proof  that  vt{9,p)  is  an  interval;  see  Figure  3.8.  cannot 
intersect  qvb  (block  visibility  in  direction  b)  without  also  intersecting 
either  qv^  or  qv^. 

time.  A  variemt  on  the  algorithm,  sketched  in  the  previous  lemma,  finds  the  edges 
from  a  and  u  that  augment  the  path  graph  in  O(nT)  time.  The  endpoints  of  these 
edges,  taken  together  with  the  barrier  boundaries,  give  all  the  botmdary  edges  of 
the  augmented  path  graph.  The  following  theorem  siunmarizes  these  results: 

Theorem  3.2.  The  augmented  path  graph  can  be  constructed  in  O(rn^)  time. 


3.5  Pruning  the  path  graph 

It  is  possible  to  find  a  shortest  path  by  running  Dijkstra’s  algorithm  on  the  aug¬ 
mented  path  graph,  Ga-  However,  because  the  tangent  edges  in  Ga  can  have  Q(n^) 
endpoints  altogether,  the  best  implementation  of  Dijkstra’s  algorithm  known  may 
take  f2(n^logn)  time  in  the  worst  case  [FT84].  To  reduce  the  time  to  0(n%  we 
need  a  graph  with  fewer  nodes.  This  section  and  the  following  section  show  how 
to  produce  a  graph  equivalent  to  Ga  that  has  fewer  nodes.  This  section  deletes 
unusable  tangent  edges  and  their  endpoints;  the  next  section  groups  nodes  together 
and  modifies  edges  to  link  the  groups. 

We  now  show  how  to  eliminate  some  edges  of  Ga  that  lie  on  no  shortest  path 
between  a  and  w;  we  identify  these  edges  using  only  local  tests.  After  deleting  the 
edges  from  Ga,  we  delete  any  resulting  nodes  of  degree  two  and  merge  their  incident 
edges.  The  result  is  a  pruned  path  graph,  Gp. 

We  have  already  noted  that  in  the  path  graph  there  are  both  clockwise  and 
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counterclockwise  tangent  edges  from  a  barrier  arc.  There  are  inner  and  outer  com¬ 
mon  tangent  edges  of  both  types.  Altogether,  the  four  possible  combinations  of 
clockwise/counterclockwse  with  inner/outer  give  four  classes  of  path  graph  edges 
from  each  barrier  arc.  Within  each  class  all  tangents  to  an  arc  are  cyclically  ordered. 
The  following  lemma  shows  how  to  identify  unusable  edges  of  each  claas. 

Lemma  3.8.  Suppose  ci  and  C2  in  Ga  are  consecutive  clockwise  outer  tangents 
going  from  to  barriers  B^^  and  with  p2  to  the  right  of  the  directed  line 
from  q  to  Pi .  If  B^^  and  B^  intersect,  the  continuation  of  cj  intersects  B^\  thus 
defining  a  bay.  (See  Figure  3.15.)  Then  edge  cj  cannot  appear  on  a  shortest  path 
unless  a  or  u)  lies  in  the  bay. 


Figure  3.15.  Because  B^^  and  B^  intersect,  edge  cj  cannot  appear  on 
a  shortest  path  unless  a  or  u  lies  in  the  shaded  bay. 


Proofs  The  continuation  of  C2  must  intersect  B***;  otherwise  it  would 
separate  and  B*^.  If  neither  a  nor  u;  lies  in  the  bay,  any  path  that 
enters  the  bay  can  be  shortened  by  re-routing  it  to  avoid  the  bay.  Any 
path  that  uses  C2  has  no  forward  continuation  (wrapping  around  the 
corner),  because  all  such  continuations  enter  the  bay.  | 
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Similar  conclusions  hold  for  pairs  of  consecutive  edges  from  the  other  three 
classes:  clockwise  inner  tangents,  counterclockwise  outer  tangents,  and  counter¬ 
clockwise  inner  tangents. 

The  preceding  lemma  gives  a  way  of  pruning  the  augmented  path  graph  to 
remove  unneeded  edges.  The  following  lemma  shows  that  the  pruning  can  be  per¬ 
formed  in  O(rn^)  time  overall. 

Lemma  3.9.  All  prunable  edges  from  arcs  generated  by  a  polygon  vertex  q  can  be 
found  and  discarded  in  0(rn)  time. 

Proof:  Each  pinning  test  takes  only  0(r)  time.  Suppose  that  Ci  and  ej 
are  consecutive  edges  of  the  same  class  tangent  to  and  B^.  If  P*** 
and  B’^  intersect,  we  can  find  the  points  a,  b,  and  c  shown  in  Figure  3.16 
using  0(t)  time.  Since  a  and  u  lie  outside  the  barriers,  a  or  u;  lies  in 
the  shaded  bay  if  and  only  if  it  lies  in  the  triangle  Aabc,  which  can  be 
checked  in  constant  time.  If  neither  a  nor  u>  lies  in  Aabc,  then  edge  62 
may  be  removed. 


Figure  3.16.  Point  a  or  u?  is  in  the  bay  if  and  only  if  it  is  in  Aabc.  If 
neither  a  nor  a;  lies  in  Aabc,  then  62  can  be  pruned. 


55 


CHAPTER  3.  MOVING  A  NON-ROTATING  CONVEX  BODY 


To  check  whether  an  edge  needs  pruning,  we  need  only  test  it  against 
edges  of  the  same  class  whose  endpoints  precede  or  follow  it  on  the  same 
arc.  When  an  edge  is  pruned,  two  edges  that  were  not  neighbors  become 
neighbors  and  must  be  tested.  However,  because  there  are  0(n)  path 
graph  edges  of  each  class  from  arcs  generated  by  q,  only  0(n)  tests  are 
needed  for  each  polygon  vertex  q.  | 

If  two  successive  edges  of  the  same  class  go  to  overlapping  translated  copies  of  B 
and  neither  a  nor  to  lies  in  the  bay  reached  by  the  inner  edge  (cj  in  Figxire  3.16),  the 
inner  edge  is  pruned.  In  cases  like  the  one  shown  in  Figure  3.17,  many  unnecessary 
edges  can  be  pruned.  After  pruning,  at  most  two  pairs  of  successive  edges  go  to 
overlapping  barriers.  (Each  unpnmed  inner  edge  defines  a  bay  containing  a  or  u.) 
The  following  lemma,  which  is  used  in  the  next  section,  shows  that  if  barriers 
reached  by  successive  edges  of  the  same  class  do  not  overlap,  neither  do  barriers 
reached  by  non-successive  edges. 


Figure  3.17.  In  this  example,  pruning  removes  many  useless  edges.  Of 
the  edges  shown,  only  the  leftmost  can  appear  on  a  shortest  path,  since 
neither  a  nor  u  lies  in  any  of  the  bays  formed  by  the  other  edges. 


Lemma  3.10.  Let  E  be  a  subset  of  the  edges  in  Gp  that  connect  B’  to  arcs  of 
other  barriers,  chosen  so  that  all  edges  in  E  have  the  same  class.  Let  U  be  the  set 
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of  polygon  vertices  that  generate  the  arcs  reached  by  edges  of  E.  The  angles  of  the 
edges  in  E  give  a  cyclic  ordering  on  the  elements  of  U.  If  B*  is  empty  for  all 
consecutive  generators  p  and  s,  then  B^  fl  B*  is  empty  for  all  distinct  p  and  s  in  U. 

Proof:  Suppose  to  the  contrary  that  B’’’  H  B*  is  non-empty  for  some  p 
and  s  in  U.  Choose  p  and  s  such  that  p  precedes  s  and  the  size  of  the 
subset  of  U  between  p  and  s  is  minimized.  By  hypothesis  this  subset 
contains  some  element  r.  Since  the  common  tangent  from  B^  to  B' 
reaches  a  visible  point,  the  point  of  tangency  is  in  the  shaded  region 
(see  Figure  3.18).  This  implies  that  B'  extends  beyond  (in  Figure  3.18, 
to  the  right  of)  any  line  passing  through  the  intersection  of  B^  and  B* 
and  tangent  to  B’.  To  get  out  of  the  shaded  region  without  overlapping 
B^  or  B*,  B^  must  cross  the  tangent  from  B’  to  B*,  and  hence  that 
edge  cannot  be  in  the  path  graph,  a  contradiction.  | 


3.6  Node  coalescing  in  the  pruned  path  graph 

We  have  shown  that  any  shortest  path  from  a  to  u  follows  edges  of  the  pnmed 
path  graph,  Gp.  Because  Gp  may  still  have  fi(n^)  nodes,  Dijkstra’s  algorithm  may 
require  fl{n^  log  n)  time  when  applied  to  it.  In  this  section,  we  show  how  to  modify 
the  graph  so  that  Dijkstra’s  algorithm  takes  only  0(n^)  time  when  applied  to  the 
result.  Because  nodes  of  Gp  are  coalesced  during  the  modification,  we  call  the 
resulting  graph  the  coalesced  graph,  Gc- 

The  modifications  of  this  section  group  consecutive  nodes  on  the  boundaries  of 
barriers.  To  better  describe  the  modifications,  we  first  characterize  barrier  bound¬ 
aries.  Every  boxmdary  between  a  maximal  connected  barrier  region  zind  the  barrier- 
free  region  is  a  ring  of  path  graph  nodes  and  edges.  (A  barrier  may  have  several 
such  rings  if  it  has  holes.)  Any  subpath  of  a  shortest  path  that  uses  edges  from 
one  of  these  rings  follows  them  in  a  consistent  direction,  either  clockwise  or  coun¬ 
terclockwise.  A  ring  is  entered  and  left  via  tangents  to  it,  2ind  these  connections 
are  consistent  with  the  orientation  of  the  ring.  See  Figure  3.19.  Any  path  in  Ga 
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Figure  3.18.  Neighbor  non-intersection  implies  global  non- intersection. 
The  tangents  from  J3’  to  B'’,  and  B’  occur  in  clockwise  order.  If 
B^  and  B*  intersect,  but  intersects  neither  of  them,  then  B^  blocks 
the  tangent  to  B*. 
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that  traverses  rings  in  a  single  direction  and  connects  to  them  consistently  is  called 
forward-going. 


Figure  3.19.  A  forward- going  subpath 

An  edge  of  the  pruned  path  graph  can  be  traversed  in  either  direction,  since 
the  path  graph  is  undirected.  As  the  first  step  in  forming  the  coalesced  graph,  we 
transform  Gp  into  a  directed  graph,  replacing  each  imdirected  barrier  boundary  ring 
by  two  oppositely  directed  copies  of  itself.  Each  tangent  edge  is  replaced  by  two 
oppositely  directed  copies  of  itself,  and  each  copy  is  connected  to  the  ring  consistent 
with  its  direction.  The  procedure  of  duplication  and  connection  is  depicted  in 
Figure  3.20.  Path  graph  edges  from  a  and  u  axe  replaced  by  directed  edges  (outgoing 
from  a,  incoming  to  w)  whose  other  endpoints  are  connected  to  the  appropriate 
directed  rings.  In  this  directed  graph,  which  we  call  Gdi  only  forward  motion  is 
possible.  Every  shortest  path  from  a  to  is  present  in  Gd- 


Figure  3.20.  Construction  of  Gd.  Each  path  graph  edge  is  replaced 
by  two  oppositely  directed  edges;  the  edges  are  linked  up  to  allow  only 
forward-going  paths. 
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To  help  define  the  nodes  of  the  cozdesced  graph,  we  introduce  distinguished 
nodes,  which  are  a  subset  of  the  nodes  of  Gd-  (We  show  how  to  select  this  subset 
later.)  The  distinguished  nodes  have  three  properties  that  we  use  presently.  First, 
all  arc  endpoints  are  distinguished  nodes.  Second,  the  tangents  to  consecutive 
distinguished  nodes  on  a  single  arc  differ  in  direction  by  at  most  ir/2.  Third,  every 
teingent  edge  that  ends  between  two  consecutive  distinguished  nodes  on  a  single  arc 
is  at  least  as  long  as  the  interval  between  the  distinguished  nodes,  measured  along 
the  barrier  boimdary. 

The  distinguished  nodes  break  the  barrier  boundary  rings  into  intervals.  (Note 
that  each  point  on  the  boundary  of  a  barrier  belongs  to  two  intervals:  one  for  each 
direction  of  traversal.)  Somewhat  unconventionally,  we  view  the  distinguished  nodes 
as  single-point  closed  intervals  distinct  from  the  open  intervals  they  separate.  The 
set  of  all  such  intervals  (both  open  and  closed)  forms  the  node  set  of  the  coalesced 
graph,  Gc. 

The  edges  of  the  coalesced  graph  connect  the  intervals  defined  by  the  distin¬ 
guished  nodes.  An  edge  linking  two  intervals  is  assigned  a  weight  (roughly  equiv¬ 
alent  to  length)  as  if  it  went  from  the  forward  point  of  one  interval  (as  defined  by 
the  ring  orientation)  to  the  forward  point  of  the  other. 

We  first  assign  weights  to  tangent  edges.  Each  tangent  edge  of  Gd  links  intervals 
(either  open  or  closed)  delimited  by  distinguished  nodes.  In  Figure  3.21,  the  edge 
from  a  to  5  links  the  intervals  (a',  a")  and  (6',  6").  We  assign  the  corresponding  edge 
in  Gc  the  weight  |a61  —  |a"a|  -|- 166"|,  where  distances  |a"a|  and  |66"|  eire  meaisured 
along  the  barrier  boundary.  This  is  the  net  forward  motion  of  a  hypothetical  car 
that  backs  up  from  a"  to  a  along  the  arc,  goes  forward  along  ab,  and  then  advances 
from  b  to  5"  along  that  arc.  (The  fifth  condition  on  B  means  that  the  length 
computation  takes  0(r)  time.)  Since  there  is  at  most  one  edge  of  Gd  from  one  arc 
to  any  other,  there  is  at  most  one  tangent  edge  connecting  any  pair  of  intervals. 
Because  \oi'oL"\  and  are  no  greater  than  |a6|,  the  edge  linking  the  intervals  of  a 
and  b  in  the  coalesced  graph  has  non-negative  weight. 

We  now  assign  weights  to  boundary  edges.  The  distinguished  nodes  brezdc  each 
barrier  botmdary  into  a  ring  of  intervals.  These  rings  are  a  coarsening  of  the  rings 
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Figure  3.21.  The  edge  of  Gc  corresponding  to  the  directed  edge  from  a 
to  h  is  assigned  weight  |a6|  —  |a"a|  +  |66''| 

of  boundary  edges  in  Gd.  Prom  a  single-point  interval  [o',  a'\  to  its  open  successor 
(o',  a")  we  create  an  edge  with  weight  Ia'a"|.  An  edge  of  weight  0  goes  from  the 
open  interval  (o',  a")  to  its  closed  successor  Note  that  each  edge  joining 

adjacent  intervals  has  weight  equal  to  the  distance  between  the  forward  points  of 
the  intervals. 

If  we  are  to  use  the  coalesced  graph  to  find  shortest  paths,  we  must  show  that 
shortest  paths  in  Gc  correspond  to  shortest  paths  in  the  augmented  path  graph. 

Lemma  3.11.  Any  minimum  weight  path  from  a  to  u)  through  the  coalesced  graph 
corresponds  to  a  shortest  path  of  the  same  length  in  the  augmented  path  graph. 

Proof:  The  proof  has  two  parts.  We  first  show  that  every  path  in  Gd 
maps  to  a  path  in  the  coalesced  graph  with  the  same  length.  Next  we 
show  that  every  path  in  Gc  that  does  not  correspond  to  a  forward-going 
path  has  weight  greater  than  the  length  of  the  shortest  forward-going 
path. 

A  shortest  path  through  Gd  is  also  a  shortest  path  in  the  augmented 
path  graph,  since  a  shortest  path  is  a  forward-going  path.  When  the 
nodes  of  Gd  are  coalesced  into  the  intervals  of  Gc,  the  lengths  of  forward¬ 
going  paths  are  preserved.  The  coalesced  edge  weights  are  calculated  so 
that  when  a  path  travels  forward  along  a  barrier  botmdary,  the  perimeter 
length  is  accounted  correctly.  For  example,  in  Figure  3.22,  the  forward¬ 
going  path  from  s  to  f  has  length  |sa|  -I-  |a61  +  |&/|*  Ini  Gc  the  path  has 
two  edges,  one  from  s  to  the  interval  (a^,  o!*')  and  one  from  the  interval 
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to  /.  These  edges  have  weights  |sa|  +  \aa"\  and  |6/|  —  |a"6|,  which  sum 
to  the  correct  value. 


/ 


Figure  3.22.  Forward-going  paths  are  charged  accurately  in  the  coalesced 
graph.  The  path  from  s  to  f  has  weight  (|sa|  4-  |aa"|)  +  (|6/|  —  |a"6|)> 
which  is  equal  to  |sa|  -I-  |a61  +  |6/|,  the  Euclidean  length  of  the  path. 

Grouping  edge  endpoints  into  intervals  allows  some  paths  in  Gc  that 
axe  not  derived  from  forward-going  paths.  These  are  subpaths  that  enter 
an  interval,  back  up,  and  then  leave,  as  in  Figure  3.23.  The  edge  weights 
of  such  a  subpath  in  Gc  add  up  to  less  than  the  Euclidean  length  of  the 
path.  However,  we  show  that  the  total  weight  of  a  path  in  Ge  from  a 
to  u  that  includes  such  a  subpath  is  still  greater  than  the  length  of  the 
shortest  path  in  Gd- 


Figure  3.23.  Loops  are  undercharged,  but  not  too  much.  The  path 
3  a  b  f  has  weight  |sa|  -|-  |6/|  —  |a6|,  which  is  less  than  its 
length,  but  more  than  the  length  of  the  path  s  —*  x  f .  This  latter 
path  is  longer  than  the  shortest  path  from  s  to  /,  so  the  loop  cannot  be 
mistaken  for  a  shortest  path. 


61 


The  subpath  in  Figure  3.23  has  weight  (lsa|  +  |aa"|)  +  (|6/|  —  |a"6|)  = 
|sa|  4- 16/|  —  |a6|,  which  is  less  than  |sa|  4-  |a6|  4-  \bf\,  the  actual  length 
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of  the  subpath.  However,  because  |5a|  and  \bf\  are  both  greater  than 
|a'a"|  and  the  difference  between  the  tangent  directions  at  b  Eund  a  is  at 
most  7r/2  {Lsxf  >  ^r/2),  the  edges  so  and  bf  intersect  at  x.  Since  Ia6|  is 
measured  along  a  convex  barrier  boundary,  \ax\  +  |®&|  >  |a&|.  Though 
the  weight  of  the  subpath  is  less  than  its  actual  length,  it  is  still  more 
than  |sx|  +  |x/|,  the  length  of  the  subpath  that  follows  lx  to  x  and  then 
follows  HJ.  Because  of  the  open  corner  at  x,  this  second  subpath  cannot 
be  part  of  any  shortest  path. 

The  preceding  analysis  assumes  that  the  loop  x  a  h  x  \s 
isolated  from  any  other  such  loop.  We  now  show  that  the  assumption 
is  valid.  If  an  edge  ba  connects  two  loops  as  in  Figure  3.24(a),  the 
two  loops  axe  independent:  the  subpath  in  Gc  that  contains  them  has 
weight  at  least  |sc|  +  |cd|  +  |df  [,  which  is  the  length  of  an  easily  identified 
subpath.  If  intersections  c  and  d  appeared  in  opposite  order,  the  two 
loops  would  not  be  independent:  the  lower  bound  would  be  |sc|  +  \df  \  — 

\cd\,  which  corresponds  to  no  recognizable  subpath.  However,  because 
Lsca  and  Lfdh  are  at  least  ir/2,  the  interlocking  loops  of  Figure  3.24(b) 
are  impossible. 

Any  path  from  a  to  a;  in  Gc  that  includes  instances  of  doubling  back, 
as  in  Figxire  3.23,  has  weight  greater  than  the  length  of  the  shortest  path 
in  Gd-  Consider  replacing  each  instance  of  doubling  back  by  the  shortcut 
corresponding  to  the  subpath  in  Figure  3.23  that  follows  3x  to  x  and 
then  follows  x/.  The  length  of  the  resulting  barrier-avoiding  path  is  less 
than  the  weight  of  the  path  through  Gc  from  which  it  is  derived.  The 
resulting  path  is  not  a  shortest  path  from  a  to  u;,  since  short-cutting 
the  comer  corresponding  to  Laxf  gives  a  shorter  path.  This  means  that 
no  path  through  Gc  that  is  not  forward-going  can  be  mistaken  for  a 
shortest  path.  | 

The  proof  of  the  preceding  lemma  uses  the  three  properties  of  distinguished 
nodes  mentioned  earlier.  We  now  show  how  to  select  the  distinguished  nodes  so 
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(a)  (b) 

Figure  3.24.  Two  loops  connected  by  a  single  edge;  (a)  possible  and 
(b)  impossible  configurations.  Because  angles  Lsca  and  Lfdh  axe  at 
least  7r/2,  the  points  a,  d,  c,  and  6  occxir  in  that  order  along  ab. 

that  these  properties  hold.  (The  properties,  once  again,  are  the  following:  all 
arc  endpoints  are  distinguished  nodes,  the  tangent  directions  at  consecutive  distin¬ 
guished  nodes  differ  by  at  most  it  12,  and  every  tangent  edge  is  at  least  as  long  as 
the  intervals  containing  its  endpoints.)  There  are  a  number  of  ways  of  selecting 
distinguished  nodes;  the  one  that  we  adopt  here  is  perhaps  the  simplest  nontrivial 
method. 

Otir  approach  characterizes  the  robot  with  two  parameters.  We  take  I  to  be  the 
diameter  of  B,  that  is,  the  maximum  distance  between  two  points  of  B.  This  is 
also  equal  to  the  maximtim  separation  of  two  parallel  supporting  lines  of  B.  The 
minimum  separation  w  of  two  such  lines  is  called  the  width  of  B. 

The  first  distinguished  nodes  we  choose  are  those  path  graph  nodes  that  are 
intersections  of  arcs  or  botmdary  segments  with  each  other,  including  the  junctions 
between  arcs  and  segments  at  their  endpoints.  Kedem  et  al.  (KLPS86]  show  that 
there  are  only  0{n)  such  intersections.  We  add  distinguished  nodes  to  each  arc 
so  that  no  interval  on  an  arc  is  longer  than  1.  We  then  add  distinguished  nodes 
to  guarantee  that  tangent  directions  at  successive  distinguished  nodes  differ  by  at 
most  t/2.  (Note  that  these  two  conditions  on  intervals  need  only  be  met  if  the 
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interval  contains  an  endpoint  of  a  tangent  edge.  If  an  interval  is  too  long  or  turns 
by  more  than  7r/2,  but  has  no  tangent  endpoints  in  it,  there  is  no  need  to  break  it 
in  two  with  a  distinguished  node.  Therefore  all  distinguished  nodes  can  be  taken 
to  be  path  graph  nodes.)  So  fax,  we  have  chosen  0(n)  distinguished  nodes.  Now 
we  add  to  the  set  of  distinguished  nodes  the  endpoints  of  all  tangent  edges  shorter 
than  /.  This  guarantees  the  non-negativity  of  all  edge  weights  in  Gc.  The  following 
lemma  boimds  the  number  of  distinguished  nodes  added  in  the  final  step. 

Lemma  3.12.  Let  w  be  the  width  and  I  the  diameter  of  B.  The  pruned  path  graph 
has  0(nl/w)  edges  shorter  than  1. 

Proof;  The  proof  concentrates  on  the  barriers  connected  to  by  tan¬ 
gent  edges.  It  uses  Lemma  3.10  to  divide  the  barriers  into  four  sets, 
each  of  them  firee  of  overlaps.  For  each  set,  the  subset  connected  to  B’ 
by  edges  shorter  than  I  lies  in  a  relatively  small  region  close  to  B’.  Area 
arguments  show  that  there  are  only  0{lfw)  barriers  in  each  such  subset. 

As  in  Lemma  3.10,  let  the  set  U  contain  the  generators  of  the 
arcs  reached  from  B’  by  tangent  edges  of  a  single  class  in  the  pruned 
graph,  Gp.  The  elements  of  U  are  cyclically  ordered  by  the  angles  of  the 
edges.  At  most  two  pairs  of  consecutive  vertices  in  U  generate  overlap¬ 
ping  barriers.  If  we  remove  from  U  any  vertex  (there  are  at  most  two) 
whose  corresponding  edge  leads  to  a  bay  containing  Of  or  w.  Lemma  3.10 
implies  that  the  barriers  generated  by  the  remaining  vertices  are  pairwise 
disjoint. 

Let  R,  be  the  region  consisting  of  all  points  closer  than  21  to  B*.  The 
area  of  R  is  0(P).  Every  vertex  in  U  corresponding  to  a  tangent  edge 
shorter  than  I  generates  a  barrier  that  is  entirely  contained  in  R.  (See 
Figure  3.25.)  These  barriers  are  non-overlapping,  and  each  has  area 
hence  there  are  at  most  0(1/ w)  of  them.  This  argument  applies 
to  all  four  tangent  edge  classes,  and  so  for  each  polygon  vertex  9,  there 
are  0(1/ w)  edges  in  Gp  tangent  to  B«  and  shorter  than  /.  Summing  over 
all  vertices  proves  the  lemma.  I 
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Figure  3.25.  The  barriers  reached  from  by  tangent  edges  of  a  single 
class  in  the  pruned  path  graph  are  non-overlapping  (with  at  most  two 
exceptions).  Those  connected  to  B’  by  edges  shorter  thsm  I  lie  com¬ 
pletely  inside  the  region  closer  than  21  to  B«  (the  shaded  region).  This 
region  has  area  0{P),  and  each  barrier  has  area  so  the  number 

of  short  edges  is  0(l/w). 
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Our  algorithm  finds  shortest  paths  by  producing  a  graph  on  which  Dijkstra’s 
single-source  shortest  path  algorithm  will  run  quickly,  then  invoking  that  algorithm. 
Sections  .3.4  through  3.6  show  how  to  construct  the  coalesced  graph  in  O(rn^) 
time.  Dijkstra’s  algorithm  takes  0(n^  -h  min(n, //u?)nlogn)  to  find  the  shortest 
path  in  Gc’,  this  shortest  path  corresponds  directly  to  the  shortest  path  in  the 
original  setting.  (The  min  function  is  present  because  there  axe  at  most  O(n^) 
tangent  edges  shorter  than  /  in  the  path  graph.)  Overall,  our  algorithm  uses  time 
0(rn^  +  xoin{n,  I /w)n  log  n)  and  space  O(n^).  If  J?  is  independent  of  n,  as  in  the 
important  special  case  in  which  B  is  a  disk,  the  algorithm  finds  shortest  paths  in 
0(n^)  time  and  space.  If  B  depends  on  n  but  the  aspect  ratio  l/w  is  0(n/ log  n), 
the  nmning  time  of  the  algorithm  is  still  O(rn^).  This  concludes  the  proof  of  the 
following  theorem: 

Theorem  3.3.  Given  a  non-rotating  convex  body  A,  a  set  S  of  simple  polygons 
with  a  total  of  n  vertices,  and  two  points  a  and  u,  it  is  possible  to  £nd  a  shortest 
polygon-avoiding  path  for  A  from  a  to  uf  in  0(n^)  time,  where  the  implied  constant 
depends  on  the  complexity  of  A. 


3.7  Conclusions  and  open  questions 

In  this  chapter  we  have  presented  an  algorithm  to  find  shortest  paths  for  a  non¬ 
rotating  convex  body  moving  among  polygonal  obstacles.  We  first  reduced  the 
problem  to  the  equivalent  problem  of  moving  a  point  among  “fattened”  obstacles 
[LW79,GRS83].  We  then  introduced  the  concept  of  tangent- visibility  to  describe  the 
sequence  of  barrier  botmdary  points  swept  by  a  tangent  ray  continuously  rotating 
aroimd  a  placement  of  B,  the  inverted  robot.  This  concept  allowed  us  to  compute 
efficiently  the  path  graph  of  the  barriers,  which  contains  the  edges  of  all  shortest 
paths.  Unfortunately  the  path  graph  may  cont2dn  f2(n^)  nodes,  so  that  Dijkstra’s 
shortest  path  algorithm  runs  in  log  n)  time  when  applied  to  it.  By  pruning 
certain  path  graph  edges  and  then  coalescing  the  remaining  nodes  into  distinguished 
nodes,  we  were  able  to  reduce  the  total  number  of  nodes  to  0(n)  and  thus  get  a 
shortest  path  algorithm  that  rims  in  O(n^)  time. 
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Perhaps  the  least  satisfying  feature  of  this  algorithm  is  its  dependence  on  the 
aspect  ratio  l/w  of  the  object  to  be  moved.  According  to  our  bounds,  the  algorithm 
could  require  fi(n^logn)  time  to  find  shortest  paths  for  an  ellipse  whose  aspect 
ratio  is  a  linear  function  of  n.  This  is  disturbing,  since  in  the  limit  such  an  ellipse 
becomes  a  segment,  for  which  shortest  paths  are  easy  to  find. 

Our  algorithm  uses  0(nl/w)  distinguished  nodes  in  the  construction  of  Ge-  How¬ 
ever,  this  upper  bound  on  the  number  of  necessary  distinguished  nodes  is  very 
crude.  If  we  omit  inner  common  tangents  from  the  path  graph,  we  can  show  that 
only  0(n  log(//ttf))  distinguished  nodes  are  needed  to  construct  Ge-  We  have  not 
yet  been  able  to  extend  this  botmd  to  allow  inner  common  tangents,  though  we 
believe  such  an  extension  is  possible. 

Both  upper  boimds  mentioned  are  local  bounds:  we  prove  that  no  fattened  poly¬ 
gon  vertex  needs  to  have  more  than  0(1 /w)  or  O(log(//u;))  distinguished  vertices 
on  its  boundary.  Such  botmds  fail  to  consider  the  interactions  of  multiple  barriers. 
It  seems  quite  possible  that  a  global  argument  could  boimd  the  number  of  necessary 
distinguished  nodes  by  0(n^/logn)  or  something  even  smaller. 

The  gap  between  upper  and  lower  boimds  is  large;  we  can  construct  an  example 
using  only  outer  common  tangents  in  which  Q(y\og{l / w))  distinguished  nodes  are 
needed  on  a  particular  fattened  vertex,  but  we  have  no  super-linear  global  lower 
botmds.  Improvements  to  either  our  lower  or  upper  bounds  would  be  welcome. 

The  algorithm  we  have  presented  does  not  process  5  before  a  and  u)  are  given. 
However,  it  might  be  possible  to  construct  approximations  to  the  whole  progression 
of  graphs  from  G  to  Ge  before  a  and  a;  are  given,  then  modify  them  in  0(rn) 
time  once  the  source  and  destination  are  known.  The  most  difficult  preprocessing 
task  would  be  pruning  edges  and  simultaneously  building  a  structure  to  identify 
the  pruned  edges  that  must  be  added  back  once  a  and  a;  are  known.  This  is  an 
interesting  problem,  but  it  is  probably  not  worth  working  out  the  details  so  long 
as  the  final  step  of  the  algorithm  has  to  run  Dijkstra’s  algorithm,  which  may  take 
fl(n^)  time. 


Chapter  4 

Linear  Time  Algorithms  for 
Visibility  and  Shortest  Path 
Problems  Inside  Triangulated 
Simple  Polygons 


The  essence  of  all  good  strategy  is  simplicity. 
—  Eric  Ambler,  Journey  into  Fear  (1940) 

Nothing  is  as  simple  as  it  ought  to  be. 

—  John  Hershberger  (1987) 


The  preceding  two  chapters  consider  visibility  and  shortest  path  problems  in 
which  the  obstacles  are  disjoint  line  segments  in  the  plane.  Because  they  do  not  take 
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advantage  of  special  relationships  among  the  segments,  the  algorithms  of  Chapters 
2  and  3  run  in  0(n^)  time.  Algorithms  that  exploit  the  geometry  of  the  obstacles 
can  run  faster  than  O(n^),  as  Reif  and  Storer  show  [RS85].  Bearing  this  in  mind, 
we  focus  our  attention  on  problems  in  which  the  segments  form  a  simple  polygon. 
We  obtain  algorithms  that  are  optimal  after  the  polygon  has  been  triangulated. 

All  of  the  algorithms  in  the  next  three  chapters  rtm  in  triangulated  simple  poly¬ 
gons.  (To  triangulate  a  polygon  with  n  vertices,  we  add  n  —  3  non-intersecting 
diagonals  to  the  polygon  interior,  splitting  it  up  into  n  —  2  disjoint  triangles.)  A 
recent  algorithm  of  Tarjan  and  Van  Wyk  [TV86]  for  triangulating  simple  polygons 
runs  in  0(n  log  log  n)  time,  thereby  improving  the  previous  0(n  log  n)  algorithm  of 
Garey  et  al.  [GJPT78].  Although  this  restilt  falls  short  of  the  goal  of  achieving  a 
lineeir  time  boimd,  it  shows  that  triangulating  simple  polygons  is  a  simpler  problem 
than  sorting,  and  raises  the  hope  that  linear-time  triangulation  might  be  possible. 
The  result  thus  renews  interest  in  linear-time  algorithms  on  already-triangulated 
polygons,  a  considerable  number  of  which  have  been  recently  developed  (for  lists  of 
these  see  [FM84,TV86]).  Our  algorithms  fall  into  this  class. 

In  this  chapter  we  present  algorithms  that  solve  the  following  problems  in  linear 
time,  given  a  trieingulated  simple  polygon  P  with  n  sides: 

(1)  Given  a  fixed  source  point  x  inside  P,  calcvdate  the  shortest  paths  inside  P 
from  X  to  all  vertices  of  P.  (Omr  algorithm  even  provides  a  linear- time  pro¬ 
cessing  of  P  into  a  data  structure  from  which  the  length  of  the  shortest  path 
inside  P  from  x  to  any  desired  target  point  y  can  be  found  in  time  0(log  n); 
the  path  itself  can  be  found  in  time  O(logn  +  k),  where  k  is  the  number  of 
segments  along  the  path). 

(2)  Given  a  fixed  edge  c  of  P,  calculate  the  subpolygon  Vis{P,  e)  consisting  of  all 
points  in  P  visible  from  (some  point  on)  c. 

(3)  Given  a  fixed  edge  c  of  P,  preprocess  P  so  that,  given  any  query  ray  r  em¬ 
anating  from  e  into  P,  the  first  point  on  the  boundary  of  P  hit  by  r  can  be 
found  in  O(log  n)  time. 
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(4)  Given  a  fixed  edge  e  of  P,  preprocess  P  so  that,  given  any  point  x  inside  P, 
the  subsegment  of  c  visible  from  x  can  be  computed  in  O(log  n)  time. 

(5)  Given  a  vertex  x  of  P  lying  on  its  convex  hull,  calculate  for  all  other  vertices  y 
of  P  the  clockwise  and  counterclockwise  convex  ropes  aroimd  P  from  x  to  y, 
when  such  paths  exist.  (These  are  polygonal  paths  in  the  exterior  of  P  firom  x 
to  y  that  wrap  around  P,  always  turning  in  a  clockwise  (resp.  counterclock¬ 
wise)  direction;  see  Section  4.1.1.) 

Our  results  improve  previous  algorithms  for  some  of  these  problems  (refer  to 
[Cha82,CG85,PS85a]).  All  of  the  algorithms  in  this  chapter  are  based  on  the  so¬ 
lution  to  Problem  (1)  and  exploit  interesting  relationships  between  visibility  and 
shortest  path  problems  for  a  simple  polygon.  Our  algorithm  for  solving  Problem  (1) 
extends  the  technique  of  Lee  and  Preparata  [LP84]  for  calculating  the  shortest  path 
inside  P  between  a  single  pair  of  points.  To  obtain  linear-time  performance,  it  uses 
finger  search  trees,  a  data  structure  for  efficient  access  into  an  ordered  list  when 
there  is  locality  of  reference  (see  [GMPR77,HM82]). 

The  chapter  contmns  three  main  sections.  Section  4.1  presents  a  linear-time 
solution  to  the  shortest  path  Problem  (1),  constructing  all  shortest  paths  from  a 
source  to  the  polygon  vertices.  We  solve  Problem  (5)  as  an  easy  application  of  this 
result.  Section  4.2  extends  the  result  of  Section  4.1  to  answer  shortest  path  queries, 
as  specified  in  the  second  part  of  (1).  This  extension  is  important  in  Chapter  5, 
so  Section  4.2  characterizes  it  in  some  detail.  Section  4.3  concludes  the  chapter  by 
solving  the  visibility  problems  (2)-(4). 
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4.1  Calculating  a  shortest  path  tree  for  a  simple 
polygon 


Two  roads  diverged  in  a  wood,  and  I — 
I  took  the  one  less  travelled  by 
And  that  has  made  all  the  difference. 

—  Robert  Frost,  “The  Road  Not  Taken”  (1916) 


Let  P  be  a  (triangulated)  simple  polygon  having  n  vertices,  and  let  s  be  a  given 
source  vertex  of  P.  (Our  algorithm  will  also  apply,  with  some  minor  modifications, 
to  the  case  in  which  s  is  an  arbitrary  point  in  the  interior  or  on  the  boundary  of  P. 
For  the  sake  of  exposition,  the  algorithm  below  is  described  for  the  case  in  which  s 
is  a  vertex  of  P,  and  we  later  comment  on  the  modifications  required  to  handle 
the  case  of  an  arbitrary  source  s.)  For  any  two  points  p  and  q  inside  P  or  on  its 
boundary,  we  denote  by  ir(p,  g)  the  Euclidean  shortest  path  from  p  to  g  that  does 
not  go  outside  P.  It  is  well  known  (see  [LP84])  that  7r(p,  q)  is  a  unique  polygonal 
path  whose  corners  are  vertices  of  P.  Furthermore,  the  union  Uv7r(s,v),  taken  over 
all  vertices  v  of  P,  is  a  plane  tree  rooted  at  s.  We  call  this  tree  the  shortest  path 
tree  of  P  with  respect  to  s  and  denote  it  by  SPT(P,s).  (See  Figtire  4.1.)  This 
tree  has  n  nodes,  namely  the  vertices  of  P,  and  its  edges  eire  straight  segments 
connecting  these  nodes.  Our  goal  is  to  calculate  this  tree  in  linear  time,  once  a 
triangulation  of  P  is  given. 

Let  <?  be  a  triangulation  of  the  interior  of  P.  The  planar  dual  of  G  (whose 
vertices  are  the  triangles  in  G  and  whose  edges  join  two  such  triangles  if  they 
share  an  edge)  is  a  tree  T,  each  of  whose  vertices  has  degree  at  most  three.  The 
counterclockwise  order  of  the  edges  incident  to  a  vertex  of  T  with  degree  three  is 
determined  by  the  triangulation.  For  an  example  of  a  triangulation  and  its  dual 
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Figure  4.1.  The  shortest  path  tree  is  the  union  of  the  shortest  paths 
from  a  source  vertex  s  to  all  other  vertices. 

tree,  see  Figure  4.2.  If  a  vertex  i  of  P  is  incident  to  k  diagonals,  then  it  is  incident 
to  Jk  +  1  triangles.  We  call  the  nodes  of  T  that  correspond  to  these  triangles  the 
neighbors  of  t,  written  N(t).  For  each  vertex  t  of  P,  there  is  a  unique  minimal  path 
in  T  connecting  a  node  in  N(3)  to  a  node  in  N(t).  The  edges  of  this  path  correspond 
to  an  ordered  sequence  of  diagonals  di,d2, ...  ,di  of  P.  (Choosing  the  minimal  path 
between  N(s)  and  N(t)  ensures  that  the  sequence  di,...,di  contains  no  diagonal 
with  5  or  t  as  an  endpoint.)  The  diagonals  d,  are  exactly  those  that  divide  P 
into  two  parts,  each  containing  one  of  s  and  t.  Because  any  path  that  crosses  a 
diagonal  twice  can  be  shortened  by  removing  both  crossings,  r(s,t)  intersects  only 
the  diagonals  d,  ,  1  <  i  <  /,  and  each  of  them  exactly  once. 

Let  d  =  uw  be  a  diagonal  or  an  edge  of  P  and  let  a  be  the  nearest  common 
ancestor  of  u  and  w  in  the  shortest  path  tree  SPT(P,s).  It  is  shown  in  [LP84] 
that  7r(a,u)  and  7r(a,w)  are  both  inward  convex;  that  is,  the  convex  hull  of  each 
of  these  subpaths  lies  outside  the  open  region  bounded  by  7r(a,u),  7r(a,w),  and 
the  segment  uw.  This  implies  that  along  each  of  7r(<3,  u)  and  ir(a,w)  the  slopes 
of  the  sides  change  in  a  monotonic  fashion.  Following  [LP84],  we  call  the  union 
F  =  Fssr  =  7r(a,  u)  U  7r(a,  w)  the  funnel  associated  with  d  =  uw;  we  call  a  the 
apex  of  the  funnel  and  uw  the  base  of  the  funnel.  See  Figure  4.3  for  an  example 
of  a  funnel.  Note  that  the  apex  of  the  funnel  may  be  s,  and  that  the  funnel  can 
degenerate  into  a  single  segment:  if  a  =  it,  then  ir(a,  u?)  =  uw. 

Suppose  next  that  d  is  a  diagonal  of  P  used  by  G,  and  let  ^uwx  be  the  unique 
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Figure  4.2.  A  triangulation  and  its  dual.  The  shaded  tree  T  inside 
the  polygon  is  the  dual  of  the  triangulation.  T  has  n  —  2  nodes  and 
n  —  3  edges;  each  edge  corresponds  to  a  diagonal  of  the  triangulation. 
Vertex  s  is  incident  to  three  triangles,  vertex  t  to  only  one.  These  two 
sets  of  triangles  correspond  to  two  sets  of  nodes.  The  shortest  path  in  T 
between  the  two  sets  connects  the  circled  nodes  of  T;  the  edges  on  this 
path  correspond  to  the  diagonals  crossed  by  7r(s,  t). 


Figure  4.3.  The  two  shortest  paths  from  s  to  u  and  to  define  the  funnel 
belonging  to  edge  mo.  The  vertex  a  at  which  the  paths  separate  is  the 
apex  of  the  funnel,  and  the  funnel  itself  is  the  imion  of  the  two  shortest 
paths  7r(a, «)  and  ir(a,  w). 
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triangle  in  G  having  d  as  an  edge  that  does  not  intersect  the  area  bounded  between 
F  and  d.  Then  the  shortest  path  from  s  to  a:  must  start  with  7r(s,a)  and  then 
either  continue  along  the  straight  segment  ax  if  this  segment  does  not  intersect  F, 
or  else  proceed  along  either  7r(a,  u)  or  7r(a,  w)  to  a  vertex  v  such  that  ux  is  a  tangent 
to  F  at  u,  and  then  continue  along  the  straight  segment  vx  (see  Figure  4.4).  The 
tangent  vertex  is  the  unique  funnel  vertex  v  where  the  slope  of  the  funnel  zind  the 
slope  of  vx  are  equal.  These  observations  form  the  basis  for  the  algorithm  of  Lee 
and  Preparata  [LP84],  and  for  ours. 


Figure  4.4.  Splitting  a  funnel 

We  axe  now  ready  to  describe  oxir  algorithm,  which  is  essentially  a  depth-first 
search  exploration  of  the  triangulation,  starting  with  a  triangle  incident  to  the 
source.  Each  step  processes  an  tmexplored  triangle  that  shares  a  diagonal  with  one 
already  explored.  The  algorithm  maintains  a  shortest  path  tree  in  the  subpolygon 
explored  so  far.  It  also  maintains  a  funnel  for  each  diagonal  at  the  bormdary  of  the 
explored  region.  When  the  algorithm  crosses  into  a  new  triangle,  it  splits  the  funnel 
belonging  to  the  crossed  diagonal  into  two  funnels,  one  for  each  of  the  triangle’s 
other  two  sides.  The  splitting  edge  is  the  tangent  from  the  triangle’s  third  vertex 
to  the  interior  of  the  funnel.  When  one  of  the  new  sides  is  a  polygon  edge  rather 
than  a  diagonal,  the  algorithm  has  found  a  funnel  of  the  final  shortest  path  tree. 

As  the  algorithm  comes  to  process  a  diagonal  d  =  uw  of  P,  it  maintains  the 
current  funnel  F  =  F^  as  a  sorted  list  [«/, u/_i, . . . , «i, a,wi,.. . , twfc],  where  a  = 
uq  =  Wo  is  the  apex  of  F,  ir(a,u)  =  [uo5---,w/]»  7r(a,ti>)  =  [rt>o, . . . , tyjt]  (either  of 
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these  sublists  can  be  empty  except  for  a),  and  u/  =  u,Wk  =  w.  In  the  algorithm, 
we  denote  the  apex  a  of  the  present  funnel  F  by  APEX (F). 

The  algorithm  begins  by  placing  s  and  an  adjacent  vertex  vi  in  F  and  setting 
APEX{F)  =  s.  It  then  proceeds  recursively  as  follows. 

ALGORITHM  PATH(F) 

Let  u  and  w  be  the  first  and  the  last  elements  of  F,  and  let  a  =  APEX(F)  (thus 
F  =  7r(a,  u)  U  7r(a,  w)).  Let  Auwx  be  the  tinique  triangle  in  the  triangulation  G 
of  P  that  has  uw  as  an  edge  and  that  has  not  yet  been  processed  (see  Figure  4.4). 

(a)  Search  F  for  an  element  v  for  which  is  a  tangent  to  F  at  v  (if  the  straight 
line  segment  ox  does  not  intersect  F  then  v  =  a).  The  search  is  very  simi¬ 
lar  to  that  for  finding  a  tangent  to  a  convex  polygon  from  an  exterior  point 
(as  described  in  [PS85b,  page  115]).  If  we  move  a  point  v*  from  one  end  of 
the  funnel  to  the  other,  the  difference  between  the  slope  of  the  funnel  at  v” 
and  the  slope  of  v*x  varies  monotonically  (we  must  choose  the  reference  di¬ 
rection  so  the  funnel  slope  is  a  continuous  function  of  u*).  We  use  binary 
search  to  find  the  zero  point  of  the  difference:  each  (unsuccessful)  “compari¬ 
son”  performed  at  some  node  v*  during  this  search  determines  a  unique  side 
of  V*  on  which  the  desired  v  lies.  We  split  F  U  {x}  into  two  new  funnels 
Fi  =  [«,..., v,x]  and  F2  =  [x,u,. . . If  u  belongs  to  it{a,u)  then  we  set 
APEX{Fi)  :=  u,  APEX(F2)  :=  a.  If,  on  the  other  hand,  v  belongs  to  7r(a,w) 
then  APEX(Fi)  :=  a,  APEX(F2)  :=  u. 

(b)  Set  7r(s,  x)  :=  7r(3,  v)  U  ux.  (Actually  we  just  store  a  back  pointer  from  x  to  v. 
The  collection  of  all  these  pointers  will  constitute  the  required  shortest  path 
tree  SPT{P,s).) 

(c)  If  the  line  segment  ux  is  a  diagonal  of  P  then  we  call  PATH(Fi)  recursively. 

(d)  If  the  line  segment  wx  is  a  diagonal  of  P  then  we  call  PATH(F2)  recursively. 

The  list  representing  the  present  funnel  F  is  stored  in  a  finger  search  tree 
(see  [GMPR77,HM82,TV86]).  This  structure  is  essentially  a  search  tree  equipped 
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with  fingers  (which,  in  our  application,  axe  always  placed  at  the  first  node  and  at  the 
last  node  of  the  tree  in  symmetric  order).  To  facilitate  the  search  for  the  vertex  v 
at  which  the  tangent  from  x  touches  F,  we  store  with  each  item  v  of  F  appearing  in 
the  search  tree  two  pointers,  to  its  predecessor  and  successor  in  F.  This  enables  us 
to  calculate  the  slopes  of  the  two  edges  of  F  incident  to  v  in  constant  time,  and  by 
comparing  these  slopes  with  that  of  ux  we  can  tell  in  constant  time  on  which  side 
of  V  the  binary  search  should  continue.  This  structure  therefore  supports  searching 
for  a  tangent  from  a  point  x  in  time  O(log^),  where  6  is  the  distance  from  the  point 
of  tangency  to  the  nearest  finger;  it  also  supports  operations  that  split  the  tree  into 
two  subtrees  at  an  item  v  in  amortized  time  0(log5),  with  S  as  above  [TV86]. 

The  correctness  of  our  algorithm  is  a  direct  consequence  of  the  correctness  of 
the  algorithm  of  Lee  and  Preparata  [LP84]. 

The  main  difference  between  our  algorithm  and  that  of  Lee  and  Preparata  is  in 
the  techniques  for  representation,  searching,  and  splitting  of  frmnels.  In  [LP84]  the 
search  for  the  vertex  v  is  a  linear  search  starting  at  one  designated  endpoint  of  F. 
This  is  sufficient  to  guarantee  the  linearity  of  their  procedure  since  in  their  case  the 
vertices  of  F  that  are  scanned  during  the  search  for  v  are  no  longer  required,  as 
the  algorithm  always  continues  with  only  one  of  the  funnels  Fi  or  F2  (depending 
on  whether  the  next  diagonal  to  be  crossed  is  xw  or  xu).  In  our  case,  however,  the 
algorithm  may  have  to  continue  recursively  with  both  funnels,  and  thus  it  requires 
a  funnel  seeirching  and  splitting  strategy  that  uses  finger  search  trees  (and  is  thus 
subtler  than  the  simple  linear  list  representation  used  in  [LP84]),  in  order  to  obtain 
the  desired  linear  time  complexity. 

To  bound  the  time  required  by  the  algorithm  we  argue  as  follows.  Let  T  be 
the  dual  tree  of  the  triangulation  of  P.  It  is  easily  seen  that  T  has  n  —  2  nodes. 
Without  loss  of  generality,  suppose  s  lies  in  just  one  triangle  tq  of  T,  which  we  take 
to  be  the  root  of  T.  (If  s  is  a  vertex  of  P  that  lies  in  several  triangles  of  the  given 
triangulation,  then  at  least  one  of  them  will  be  bounded  by  an  edge  of  P  incident 
to  s,  and  we  can  start  the  algorithm  from  that  triangle.  The  algorithm  will  then 
correctly  propagate  the  funnel  structure  to  all  the  other  triangles  containing  3;  the 
funnels  for  the  triangle  edges  containing  s  are  all  trivial.)  Thus  eaeh  node  of  T 


CHAPTER  4.  THE  SHORTEST  PATH  TREE 


77 


(including  the  root)  has  0,  1,  or  2  children.  Our  algorithm  is  a  depth-first  traversal 
of  T.  With  each  node  ^  of  T  we  associate  the  parameter  m^,  denoting  the  size 
(number  of  edges)  of  the  funnel  F  just  before  C  is  processed.  For  example,  =  1. 

When  our  algorithm  processes  the  node  ^  of  T,  it  splits  its  funnel  into  two  parts 
and  then  appends  a  new  edge  to  both  parts  to  form  the  funnels  of  the  children 
Cl  >  C2  of  C  in  T.  If  the  split  parts  of  the  funnel  of  C  contain  mi  and  mj  =  — 

mi  edges  respectively,  then  m^,  =  mi  -|-  l,m^,  =  m2  +  1,  and  the  (amortized) 
cost  of  processing  C  using  finger  search  trees  is  K(0  =  O(min(logm(,,logm(j)). 
The  complexity  of  our  algorithm  essentially  depends  only  on  the  growth  of  the 
function  m^  over  the  nodes  ^  €T.  If  a  node  C  has  just  one  child,  then  m^t  is  at 
most  m^  -1-1;  if  C  has  two  children,  then  m^  is  split  into  two  parts,  and  each  child 
inherits  one  part  plus  one. 

We  begin  our  analysis  with  the  following  observation:  at  nodes  C  G  ^  that  have 
just  one  child  or  are  leaves,  the  “direct  costs”  sum  up  to  at  most  0(n).  Indeed, 
suppose  C  G  T  has  just  one  child  C'?  ^d  that  the  funnel  at  C  has  been  split  into 
two  parts  having  m'  and  m^  —  m'  edges  respectively.  Then  the  vertices  of  P  lying 
in  one  of  these  parts  will  never  be  encountered  again  by  the  algorithm  (by  the  same 
reasoning  used  in  [LP84]  to  justify  the  linearity  of  their  procedure).  The  number 
of  such  vertices  is  at  least  min(m',m(  —  m')  —  1  >  K{Q  —  2.  Thus  the  sum  of  all 
these  iir(C)’s  is  proportioned  to  at  most  n  -f-  2|r|  <  3n,  as  claimed. 

Next  consider  the  total  direct  costs  at  nodes  having  two  children.  We  claim 
that  it  is  suflBcient  to  consider  only  cases  in  which  m^  grows  by  exactly  one  at  eeich 
node  C  of  T  having  a  single  child,  because  these  cases  provide  maximum  growth  of 
the  function  m  down  the  tree  T.  Under  this  additional  assumption,  we  have  the 
following  lemma: 

Lemma  4.1.  For  a  node  C  G  T,  let  the  leaves  of  the  subtree  T(  ofT  rooted  at  C  be 
and  set  M(  =  T!J=i 

=  m^  +  ITcl, 


where  [TJ  is  the  number  of  edges  in  T(. 


CHAPTER  4.  THE  SHORTEST  PATH  TREE 


78 


Proof:  The  lemma  follows  from  summing  the  equations 

t 

2  =  m(  +  t 

i=i 


for  each  internal  node  ^  of  where  are  the  children  of  ^  (<  =  1 

or  2).  I 


Corollary  4.1.  in  particular,  we  have  <  M^. 


For  each  C  G  ^  let  C(C)  denote  the  total  cost  of  processing  nodes  with  two 
children  in  the  subtree  of  T  rooted  at  Then 


{0  if  C  is  a  leaf 

C(0  if  C  lias  just  one  child  C 

C(Ci)  +  €((2)  +  O(min(log , log  if  C  has  two  children  Ci,  Ca¬ 

in  solving  these  recurrence  formulas,  we  can  zissume  without  loss  of  generality  that 
each  node  in  T  is  either  a  leaf  or  has  two  children.  Moreover,  replacing  by 
(using  the  preceding  corollary)  in  these  formulas,  we  obtzdn  the  recurrence  formula 


^(0  = 


I 


0 

C'(Ci)  +  €((2)  +  0(min(logM^,,logMc3)) 


if  C  is  a  leaf 

if  C  bas  two  children  CiiCa- 


But  M(  =  Mfj  +  M^j,  if  C  bzis  children  CijCa?  and  M(  >  1  for  all  nodes  C-  Hence  if 
C*{k)  is  the  maximum  cost  C(C)  for  any  node  C  with  M(  =  k,  then  we  obtain  the 
formula 


C*(m)  =  max  {C*(A;)  +  C*(m  -  k)  +  O(min(log  k, log(m  -  A:)))}, 

l<A:<in— 1 

whose  solution  is  C*{m)  =  0{m)  (see  [Meh84,  page  185]).  Finally,  by  Lemma  4.1 
we  have  for  the  root  tq  of  T 

Mro=  mr,  +  |r|  =  n  -  2. 

Thus  the  total  complexity  of  the  algorithm  is 

0(n)  +  0(M,J  =  0(n). 

Summarizing  our  analysis,  we  obtain  the  following  theorem: 
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Theorem  4.1.  The  shortest  paths  inside  a  triangulated  simple  polygon  P  from  a 
£xed  source  vertex  to  all  the  other  vertices  of  P  can  be  calculated  in  linear  time. 

Remark:  As  finger  trees  are  complicated  to  implement,  we  could  have  obtained  a 
simpler  but  less  efficient  version  of  the  algorithm  by  maintaining  funnels  simply  as 
doubly-linked  linear  lists  (the  same  data  structure  as  that  used  in  [LP84]),  and  by 
performing  each  search  through  a  funnel  in  a  linear  maimer,  starting  simultaneously 
from  both  ends  of  the  funnel.  The  complexity  analysis  of  this  modified  procedure 
is  almost  identical  to  that  given  above,  except  that  the  direct  costs  at  each  triangle 
processed  are  now  linear,  rather  than  logarithmic,  in  the  smaller  of  the  subfunnel 
sizes.  This  leads  to  a  recurrence  formula  for  C*  of  the  form 

C*{m)  =  max  {C*(k)  +  C*{rn  —  k)  +  0(min(fc,  m  —  fc))), 

whose  solution  is  C*(n)  =  O(nlogn)  (cf.  [GK82,  pp.  25-27]  or  [Knu72]). 

Remark:  Another  possibility  is  to  represent  the  funnels  by  self-adjusting  search 
trees  [ST85].  A  weak  form  of  the  dynamic  optimality  conjecture  of  Sleator  and 
Tarjan  for  such  trees  (see  [ST85])  suggests  that  the  resulting  shortest  path  algorithm 
runs  in  linear  time,  although  we  do  not  know  how  to  prove  this. 

Remark:  If  the  source  s  is  not  a  vertex  of  P,  we  can  modify  the  algorithm  as  follows. 
Suppose  3  is  internal  to  a  single  triangle  A  =  Auvw  of  G.  Then  we  split  A  into 
three  subtrieingles  Asuv,  Asvw,  Aswu,  each  having  a  as  a  vertex,  and  repeat  the 
algorithm  three  times,  each  time  starting  at  one  of  these  triangles  and  propagating 
the  funnel  structure  only  through  the  edge  of  that  triangle  which  is  also  an  edge 
of  A.  Similar  problem  splitting  can  be  employed  when  s  lies  on  an  edge  of  one  or 
two  of  the  triangles  in  G.  It  is  easily  checked  that  the  modified  edgorithm  produces 
the  desired  shortest  path  tree  in  linear  time. 
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4.1.1  The  convex  rope  algorithm 

As  an  immediate  and  relatively  simple  application  of  the  shortest  path  algorithm 
just  presented,  we  consider  the  convex  rope  problem,  posed  by  Peshkin  and  Sander¬ 
son  [PS85a]:  Let  P  be  a  simple  polygon,  let  s  be  a  vertex  of  P  lying  on  the  convex 
hull  of  P,  and  let  v  be  another  vertex  of  P.  The  clockwise  convex  rope  from  s  to  u 
is  the  shortest  polygonal  path  (s  =  po, ...  ,pm  —  v)  starting  at  s  and  ending  at  v 
that  does  not  enter  the  interior  of  P  and  that  is  clockwise  convex,  in  the  sense  that 
the  clockwise  angle  from  the  directed  segment  pi-ipi  to  the  directed  segment  PiPi+i 
is  less  than  it,  for  i  =  1, . . . ,  m  —  1.  The  counterclockwise  convex  rope  from  s  to  u 
is  defined  in  a  symmetric  manner  (see  Figure  4.5).  Not  all  the  vertices  of  P  need 
have  convex  ropes  from  s.  The  vertices  v  that  have  both  clockwise  and  coimter- 
clockwise  ropes  from  any  hull  vertex  s  are  precisely  those  that  are  “visible  from 
infinity”  (see  [PS85a]);  calculation  of  these  convex  ropes  is  required  in  [PS85a]  to 
plan  reachable  grasps  of  P  by  a  simple  robot  arm.  In  [PS85a],  an  O(n^)  algorithm 
is  presented.  Using  the  shortest  path  algorithm  given  above,  we  obtain  an  improved 
algorithm  nmning  in  linear  time  plus  the  time  for  triangulation. 

The  convex  rope  problem  can  be  solved  as  follows.  First  compute  the  convex  hull 
of  P  in  linear  time  (cf.  [PS85a,GY83]).  The  clockwise  (respectively  counterclock¬ 
wise)  convex  rope  from  s  to  any  vertex  v  on  the  convex  hull  can  now  be  calculated 
by  moving  along  the  convex  hull  in  a  clockwise  (respectively  counterclockwise)  di¬ 
rection  from  s  to  V.  For  each  vertex  v  not  on  the  convex  hull,  v  lies  inside  a  simple 
polygon  Q  (a.  “bay”  of  P)  bounded  by  some  subsequence  of  the  sides  of  P  and  by 
an  edge  of  the  convex  hull  that  is  not  a  side  of  P  (see  Figure  4.5;  note  also  that  the 
collection  of  all  these  bays  can  be  found  in  linear  time). 

Let  Vi ,  V2  be  the  endpoints  of  this  edge  such  that  vi  is  reached  first  from  s  when 
moving  along  the  convex  hull  in  a  clockwise  direction.  The  clockwise  (respectively 
counterclockwise)  convex  rope  from  s  to  u  is  then  the  clockwise  convex  rope  from  s 
to  Vi  (respectively  the  counterclockwise  convex  rope  from  s  to  V2)  followed  by  the 
shortest  path  from  vi  to  v  (respectively  from  V2  to  u)  within  Q,  provided  that  this 
shortest  path  is  clockwise  (respectively  counterclockwise)  convex.  (Otherwise  the 
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Figure  4.5.  The  convex  rope  problem 

required  convex  rope  does  not  exist.)  Hence  we  can  use  the  shortest  path  tree 
algorithm  presented  above  to  calculate  the  shortest  paths  from  Vi  and  from  to  all 
the  vertices  of  Q  (and  also  to  check  whether  these  paths  are  convex  in  the  required 
directions)  in  time  0(|Q|  log  log  |Q|).  Since  the  siun  of  the  sizes  of  all  the  “bays”  Q 
of  P  is  0(n),  it  follows  that  we  can  solve  the  convex  rope  problem  in  O(nloglogn) 
time. 

4.2  The  shortest  path  map 

This  section  defines  the  shortest  path  map,  which  is  a  representation  of  all  shortest 
paths  from  a  source  vertex  s  to  points  inside  P.  The  shortest  path  map  partitions 
the  interior  of  P  into  disjoint  triangular  regions,  each  with  a  distinguished  vertex 
called  the  apex.  The  regions  are  chosen  such  that  the  shortest  path  from  s  to  a 
point  V  passes  through  the  apex  of  the  region  containing  v. 

We  produce  the  shortest  path  map  by  partitioning  the  funnels  defined  in  Sec¬ 
tion  4.1.  The  following  lemma  proves  some  properties  of  funnels  necessary  for  the 
construction. 

Lemma  4.2.  For  each  edge  e  of  P,  let  $(e)  denote  the  region  bounded  by  e  and 
by  the  funnel  Fg.  Then 
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(a)  Let  X  be  a  point  inside  such  a  region  $(e).  Let  v  be  a  vertex  in  the  cor- 
.  responding  funnel  such  that  vx  is  tangent  to  the  funnel  (in  the  terminology 

of  the  algorithm  of  Section  4. 1).  Then  the  shortest  path  from  s  to  x  is  the 
concatenation  of  the  shortest  path  from  s  to  v  with  the  segment  vx. 

(b)  The  interiors  of  the  regions  $(e)  are  all  disjoint,  and  the  union  of  these  regions 
is  the  entire  polygon  P.  The  total  number  of  edges  along  their  boundaries  is 
0(n). 

Proof:  The  first  part  of  the  lemma  follows  by  the  same  argtiment  (taken 
from  [LP84])  used  to  justify  the  correctness  of  our  algorithm.  As  to  the 
second  part,  note  first  that  if  $(ei)  and  ^(ej)  had  a  point  x  in  common, 
then  X  would  necessarily  have  two  distinct  shortest  paths  reaching  it 
from  s  (one  for  each  region  $  containing  ar),  which  is  impossible  for  a 
simple  polygon.  As  to  the  second  claim,  let  x  be  any  point  inside  P,  and 
let  A  be  the  triangle  in  the  triangulation  of  P  that  contains  x.  It  is  clear 
that  after  the  above  algorithm  processes  A,  it  will  have  produced  some 
funnel  Fg,  where  e  is  one  of  the  sides  of  A,  such  that  x  lies  between  P. 
and  e.  If  c  is  an  edge  of  P  the  claim  is  immediate;  otherwise,  follow  the 
recursion  of  our  algorithm  from  e  further  on.  Each  of  these  recursive 
steps  takes  a  region  bounded  between  some  funnel  Ft  and  its  associated 
diagonal  t,  splits  it  into  two  subregions,  and  adds  a  portion  of  the  current 
triangle  to  each  of  these  subregions.  If  the  input  region  for  such  a 
step  contains  x,  one  of  the  output  regions  must  also  contedn  x.  Thus 
eventually  our  algorithm  will  have  reached  an  edge  c  for  which  $(e) 
contains  x.  Finally,  because  SPT(P,  s)  is  a  tree  on  n  nodes,  it  has  n  —  1 
edges.  The  regions  $(c)  are  bounded  by  these  n  —  1  edges  plus  the  n 
edges  of  P,  which  proves  the  last  claim  of  (b).  | 

To  produce  the  shortest  path  map  from  the  shortest  path  tree,  we  partition 
each  funnel  with  a  non-empty  interior.  Let  e  be  jui  edge  of  P,  and  let  $(e)  be  the 
corresponding  region  of  P  as  defined  in  Lemma  4.2.  Assume  that  the  funnel  P. 
has  the  form  [u/,u/_i, . . .  ,Ui,a, u;i, . . .  with  a  =  uo  =  as  its  apex  (thus 
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e  =  ujwk)-  Then,  for  each  t  =  0, —  1  (respectively  for  each  i  =  0, . . . ,  A:  —  1),  the 
ray  emanating  from  (respectively  from  Wi)  and  passing  through  u,+i  (respectively 
through  tWi+i)  hits  c,  and  its  portion  between  c  and  u,-  (respectively  Wi)  is  fully 
contained  in  $ (e).  These  rays  partition  $(c)  into  A:  +  /  —  1  disjoint  triangles,  such 
that  each  triangle  has  two  vertices  lying  on  e,  and  its  third  vertex  (called  its  apex) 
belongs  to  the  funnel  Eg.  (See  Figure  4.6.)  The  side  of  the  triangle  opposite  its 
apex  is  its  base,  a  subsegment  of  e.  It  follows  immediately  from  Lemma  4.2(a)  that 
if  a;  €  $(e)  belongs  to  the  triangle  with  apex  q,  then  ^  is  tangent  to  the  funnel 
at  q,  and  thus  the  shortest  path  from  s  to  a;  is  the  concatenation  of  ?r(s,  q)  and  the 
segment  5®* 


w 


Figure  4.6.  Each  funnel  region  is  broken  into  triangles  by  extending  the 
edges  on  the  sides  of  the  funnel. 

The  collection  of  all  triangles  obteuned  this  way  for  all  regions  $(e)  yields  a 
partitioning  of  P  into  disjoint  triangles.  This  subdivision  of  the  polygon  interior  is 
the  shortest  path  map;  we  denote  it  by  5PJlf(P,s).  Because  no  edge  of  SPM(P,s) 
lies  on  the  boundary  of  more  than  two  regions  $(e).  Lemma  4.2(b)  implies  that  the 
shortest  path  map  has  0{n)  triangles  bounded  by  a  total  of  0{n)  edges.^  Although 
the  shortest  path  map  partions  the  interior  of  P  into  triangles,  it  is  generally  not 
a  triangulation.  Polygon  vertices  may  lie  in  the  interior  of  edges  of  the  triangular 
regions.  Furthermore,  although  the  apex  of  each  triangular  region  is  a  vertex  of  P, 

^By  an  argument  that  relates  the  number  of  single-edge  funnels  in  the  shortest  path  tree  to  the 
number  of  vertices  on  the  sides  of  funnels,  we  can  show  that  the  shortest  path  map  has  at  most 
3n  —  6  edges.  This  bound  is  tight. 
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the  other  two  vertices  of  the  triangle  need  not  be  polygon  vertices.  They  may 
instead  be  intersections  of  the  funnel  base  with  extended  edges  from  the  funnel 
sides. 

We  can  use  the  shortest  path  map  to  answer  shortest  path  queries  for  arbitrary 
points  in  P.  To  do  so,  we  use  one  of  the  standard  lineeir-time  algorithms  to  pre- 
process  SPM{P,s)  into  a  data  structure  that  supports  O(logn)-time  point  location 
queries  [Kir83,EGS86].  Given  a  query  point  x  in  P,  we  can  find  the  triangle  or  edge 
of  SPM{P,  s)  that  contains  it  in  O(log  n)  time,  and  thereby  find  the  last  vertex  of  P 
on  the  shortest  path  from  s  to  x,  the  apex  q  of  the  containing  triangle.  If  we  store 
at  each  apex  q  the  length  of  the  shortest  path  from  s  to  q,  we  can  then  calculate  the 
length  of  the  shortest  path  from  s  to  x  in  0(1)  additional  time;  the  path  itself  can 
be  calculated  in  0(1)  additional  time  per  segment  on  the  path  by  simply  traversing 
the  path  in  the  shortest  path  tree  from  q  to  3.  Thus  we  have  the  following  result: 

Theorem  4.2.  Given  a  triangulated  simple  polygon  P  with  n  sides  and  some 
source  point  s  within  P,  one  can  preprocess  P  in  linear  time  so  that,  for  any 
target  point  x  in  P,  the  length  of  the  shortest  path  from  s  to  x  can  be  calculated 
in  O(log  n)  time,  and  the  path  itself  can  be  extracted  in  time  O(log  n  -H  k),  where  k 
is  the  number  of  segments  from  which  this  path  is  composed. 

If  we  want  to  extract  the  path  in  time  proportional  to  the  number  of  ttirns  it 
makes  (rather  than  the  number  of  segments  along  it),  we  change  the  shortest  path 
tree  pointers  so  that  each  vertex  v  stores  a  pointer  to  the  vertex  x  on  7r(s,  v)  farthest 
from  V  such  that  all  vertices  on  7r(x,u)  lie  on  the  segment  xv. 

Remark:  Related  work  on  shortest  paths  inside  a  simple  polygon  can  be  found 
in  [El  85]. 

4.3  Visibility  within  a  simple  polygon 

In  this  section  we  study  a  collection  of  problems  involving  visibility  within  a  simple 
polygon.  These  problems  have  been  studied  in  recent  papers  by  several  authors 
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[EA81,Lee83,AT81,CG85,El  84,LL84,Asa84],  and  various  algorithms  have  been  de¬ 
veloped  to  solve  them.  Some  of  the  simpler  problems  already  have  linear-time  so¬ 
lutions,  whereas  others  only  have  0(n  log  n)  solutions.  Here  we  present  linear-time 
solutions  for  all  these  problems,  again  based  on  the  availability  of  a  triangulation 
of  P.  Our  approach  relies  on  an  interesting  relationship  between  visibility  and 
shortest  path  problems. 

Let  P  be  a  triangulated  simple  polygon  with  n  sides.  The  problems  studied  in 
this  section  and  solved  in  linear  time  are 

I.  Given  a  point  x  inside  P,  calculate  the  visibility  polygon  Vis(P,  x)  consist¬ 
ing  of  all  points  y  6  P  that  axe  visible  from  x  (that  is,  such  that  the  open 
segment  xy  lies  in  the  interior  of  P).  (This  problem  is  simpler  than  the  subse¬ 
quent  ones,  and  there  exist  known  linear-time  algorithms  for  it  [EA81,Lee83].) 

II.  Given  a  segment  c  inside  P,  calculate  the  (weak)  visibility  polygon  Vis(P,  e) 
consisting  of  all  y  €  P  that  are  visible  from  some  point  on  e.  (An  0{n  log  n) 
solution  is  given  in  [CG85];  Avis  and  Toussaint  [AT81]  present  a  linear  time 
algorithm  for  determining  whether  Fw(P,  c)  =  P.) 

III.  Given  a  segment  c  inside  P,  preprocess  P  so  that  for  each  query  ray  r  emanat¬ 
ing  from  some  point  on  e  into  P,  the  first  intersection  of  r  with  the  boundary 
of  P  can  be  calculated  in  O(logn)  time.  (An  0(n  log  n)  solution  is  given 
in  [CG85].) 

IV.  Given  a  segment  e  inside  P,  preprocess  P  so  tnat  for  each  query  point  x  €  P, 
the  subsegment  of  c  visible  from  x  can  be  calculated  in  0(log  n)  time.  (An 
0(n log n)  solution  is  given  in  [CG85].) 

We  shall  first  consider  Problem  I;  although  this  problem  already  has  linear  time 
solutions,  it  is  worthwhile  to  sketch  our  solution  as  a  preparatory  step  toward  the 
solution  of  the  more  complicated  problems  II-IV.  It  is  well  known  that  the  boimdary 
of  Fm(P,  x)  is  a  simple  polygon;  each  of  its  vertices  is  either  a  vertex  of  P  that  is 
visible  from  x  or  a  “shadow”  cast  on  the  boundary  of  P  by  such  a  visible  vertex. 
A  shadow  vertex  y  terminates  the  visible  portion  of  a  partially  visible  polygon 
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segment;  that  is,  y  is  one  endpoint  of  a  subsegment  whose  entire  interior  is  visible 
from  X.  The  segment  xy  passes  through  a  vertex  of  P;  see  Figure  4.7. 


Figure  4.7.  Visibility  of  a  polygon  from  a  point 


Suppose  without  loss  of  generality  that  x  is  a  vertex  of  P  (if  not,  it  is  easy  to 
construct  in  linear  time  a  triangulation  of  the  interior  of  P  in  which  x  is  also  a  vertex 
of  some  triangle).  Calculate  the  shortest  path  map  SPM(P,x)  &om  x  using  the 
algorithm  given  in  Section  4.2.  In  SPM(P,  x),  the  triangular  regions  with  x  as  their 
apex  include  all  the  points  inside  P  that  are  visible  from  x.  The  polygon  vertices 
connected  to  x  by  edges  of  SPM{P,x)  include  all  the  vertices  visible  from  x.  (Two 
of  the  neighbors  of  x  in  SPM(P,x)  are  not  visible  from  x:  edges  of  SPM{P,x) 
connect  x  to  its  neighbors  on  the  boundary  of  P,  but  our  definition  of  visibility 
does  not  count  these  as  visible  points.)  To  calculate  Vis{P,x),  we  concatenate  in 
clockwise  order  the  regions  of  SPM{P,  x)  with  x  as  their  apex,  thereby  obtaining  a 
single  simple  polygon  that  is  the  boundary  of  Vis{P,  x).  Thus  we  have  the  following 
theorem: 

Theorem  4.3.  The  visibility  polygon  Vis(P,x)  can  be  calculated  in  linear  time, 
given  a  triangulation  of  P. 

Next  we  consider  Problem  II.  Let  a,  b  be  the  endpoints  of  e.  It  is  easily  checked 
that  in  this  case  Vis{P,  e)  is  a  simple  polygon  each  of  whose  vertices  is  either 


(i)  a  vertex  of  P  visible  from  e;  or 
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(ii)  a  shadow  cast  on  the  boundary  of  P  by  a  ray  that  emanates  from  a  or  from  b 
and  passes  through  a  vertex  of  P  visible  from  that  endpoint;  or 

(iii)  a  shadow  cast  by  a  ray  r  that  emanates  from  some  interior  point  on  e  and 
passes  through  two  vertices  x,  y  of  P,  such  that  the  exterior  of  P  lies  on  one 
side  of  r  in  the  vicinity  of  x  and  on  the  other  side  of  r  in  the  vicinity  of  y. 

We  characterize  Vis{P,  e)  further  in  Section  5.1.  See  Figure  4.8  for  an  illustration 
of  7m(P,c). 


Figure  4.8.  Visibility  of  a  polygon  from  an  edge 

Let  e'  =  c3  be  another  edge  of  P.  We  say  that  the  shortest  path  7r(a,  c)  is  inward 
convex  if,  as  in  the  case  of  funnels,  the  convex  angles  formed  by  successive  segments 
of  this  path  with  the  directed  line  ab  keep  increasing.  A  symmetric  definition  of 
inward  convexity  applies  to  ir(6,  d). 

Lemma  4.3.  H  c'  contains  a  point  in  its  interior  that  is  visible  from  e  then  (up  to 
exchanging  c  and  d)  the  two  paths  7r(a,  c)  and  7r(6,  d)  are  inward  convex. 

Proof:  Let  x  6  e'  be  visible  from  some  point  z  on  c.  Suppose  without 
loss  of  generzdity  that  c  is  that  endpoint  of  e'  for  which  a  and  c  lie  on  the 
same  side  of  the  line  xz.  Then  the  shortest  path  7r(a,  c)  must  lie  entirely 
on  one  side  of  xz,  and  indeed  it  does  not  cross  the  polygonal  path  azxc. 

Since  the  zirea  R  between  azxc  and  7r(a,  c)  is  fully  contained  in  P,  it 
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follows  that  ir(a,c)  must  be  inward  convex,  or  else  we  could  shortcut 
it  by  a  segment  contained  in  R,  thus  also  in  P.  The  claim  concerning 
Tr(b,d)  follows  by  a  symmetric  argument.  | 

In  the  situation  described  by  the  preceding  lemma,  we  call  the  union  of  ir(a,  c) 
and  7r(6,  d)  the  hourglass  for  the  pair  (e,  c'). 

Suppose  we  apply  the  shortest-path  algorithm  of  Section  4.1  to  the  two  source 
vertices  a  and  b,  and  also  compute,  for  each  vertex  c  of  P,  whether  the  path  x(a,  c) 
(respectively  ir{b,c))  is  inward  convex.  (The  latter  calculations  take  0(1)  time  per 
vertex.)  Let  e'  =  cd  be  another  edge  of  P.  If  the  two  paths  7r(a,  c)  and  ir{b,  d)  are 
not  both  inward  convex,  and  also  the  two  paths  7r(a,  d)  and  ir(6,  c)  are  not  both 
inward  convex,  then  by  Lemma  4.3,  e'  is  not  visible  from  e.  Thus  suppose  (without 
loss  of  generality)  that  the  two  paths  ir(a,c)  and  Tr(b,d)  are  inward  convex.  It 
is  easy  to  see  that  the  shortest  path  7r(a,  d)  must  be  the  concatenation  of  three 
subpaths:  a  subpath  ir(a,x)  of  7r(a,c),  where  x  is  a  point  lying  on  ir(a,c),  a  line 
segment  xy,  where  y  is  a  point  lying  on  irib^d),  and  the  subpath  v{y,d)  of  ir(b,d). 
Moreover  xy  must  be  a  common  tangent  to  both  of  the  paths  7r(a,  c)  and  x(6,  d)  (see 
Figmre  4.9).  The  path  7r(6,c)  has  a  symmetric  structure  of  the  form  ir(b,w)  ||  wz  || 
7r{z,  c)  (where  ||  denotes  path  concatenation),  for  appropriate  points  w  €  ir(ft,  d), 
z  6  7r(a,  c).  It  follows  that  the  subsegment  of  e'  visible  from  e  is  that  delimited 
by  the  intersections  of  e'  with  the  two  lines  ^  zind  wz.  (If  xy  and  wz  intersect  e' 
in  the  same  point,  then  no  point  on  e'  is  visible  from  e.)  Note  also  that,  in  the 
terminology  of  Section  4.1,  when  the  shortest-path  algorithm  is  run  with  a  as  the 
source,  the  funnel  Pe'  associated  with  the  segment  e'  has  x  as  its  apex,  emd  y  as  an 
adjacent  vertex.  Similarly,  when  the  algorithm  runs  with  b  as  the  source,  w  is  the 
apex  of  the  funnel  p.'  a^d  z  is  an  adjacent  vertex  in  that  fuimel. 

These  observations  suggest  a  straightforward  method  for  calculating  the  points 
X,  y,  w  and  z.  Namely,  as  we  execute  the  shortest  path  algorithm  with  a  as  a  source, 
and  we  reach  an  edge  e'  i=  cd  of  P  for  which  the  path  x(a,  c)  is  inward  convex,  we 
take  X  to  be  the  apex  of  the  current  funnel,  and  y  to  be  the  next  vertex  on  the  part  of 
the  funnel  between  x  and  d.  The  points  w  and  z  are  found  in  a  completely  symmetric 
manner  when  running  the  shortest  path  algorithm  with  6  as  a  source.  To  calculate 
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e’  d 


Figure  4.9.  Visibility  of  one  edge  of  a  polygon  from  another,  and  the 
corresponding  hourglass 

Vis(Pye)  we  simply  traverse  the  boimdary  of  P,  say  in  clockwise  order,  collecting 
all  visible  subsegments  along  this  boundary  and  replacing  contiguous  non-visible 
portions  of  the  botmdary  by  straight  segments  connecting  visible  vertices  with  their 
shadows.  Thus  we  have  the  following  theorem: 

Theorem  4.4.  The  visibility  polygon  Vis{P,  e)  of  P  with  respect  to  an  edge  e  can 
be  calculated  in  linear  time  (assuming  a  triangulation  of  P  is  given). 

Remark:  Toussaint  has  independently  obtained  a  similar  connection  between  short¬ 
est  paths  and  visibility  inside  a  simple  polygon,  as  well  as  some  of  the  technical  tools 
developed  above  [Tou85]. 

Next  let  us  consider  Problem  IV.  Let  x  6  P  be  an  arbitrary  point  that  is  visible 
from  some  point  z  €  e.  An  immediate  generalization  of  Lemma  4.3  implies  that 
both  of  the  paths  7r(a,  x)  and  7r(6,  x)  must  be  inward  convex.  A  slight  variant  of  the 
converse  statement  is  also  true,  as  is  easily  checked:  if  both  ir(a,  x)  and  ir(b,  x)  are 
inward  convex  and  intersect  only  at  x,  then  x  is  visible  from  c.  Moreover,  let  the 
last  straight  segment  in  ir(a,  x)  (respectively  in  ir(b,  x))  be  cx  (respectively  dx)  for 
some  vertex  c  (respectively  d)  of  P.  Then  the  portion  of  e  visible  from  x  is  delimited 
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by  the  intersections  of  the  rays  icc  and  xd  with  c;  if  the  intersections  coincide,  e  is 
not  visible  from  x. 

Thus,  to  preprocess  P  as  required  in  Problem  IV,  we  execute  algorithm  of  Sec¬ 
tion  4.2  twice,  once  with  a  as  a  source,  and  once  with  6  as  a  source.  This  yields  two 
shortest  path  maps,  SPM(P,a)  and  SPM{P,b),  which  we  then  further  preprocess 
into  two  data  structures  that  support  O(log  n)  point  location  queries.  This  can  be 
done  in  linear  time,  using  the  techniques  in  [Kir83]  or  [EGS86].  In  addition,  we 
store  at  each  vertex  c  of  P  flags  indicating  whether  the  paths  ir{a,c)  and  ^r(6,  c) 
axe  inward  convex,  and  (as  usual)  also  pointers  to  the  two  parents  of  c  in  the  two 
shortest-path  trees  produced  by  the  two  runs  of  the  algorithm. 

Now  let  a:  6  P  be  a  given  query  point.  First  we  locate  x  in  the  two  subdivisions 
SPM(P,a)  and  SPM(P,b)  and  obtain  the  apexes  of  the  two  enclosing  triangles, 
vertices  c  and  d  of  P.  Then,  in  additional  constant  time,  we  can  test  whether  the 
paths  7r(a,  x)  =  7r(a,  c)  ||  cx,  ir{b,  x)  =  ir(b,  d)  ||  dx  are  both  inward  convex,  and  if 
so,  find  the  intersections  of  the  rays  xc  and  xd  with  e,  thereby  obtaining  the  desired 
subsegment  of  e  that  is  visible  flrom  x. 

Remark:  Problem  IV  is  more  general  than  the  corresponding  problem  P3  studied 
in  [CG85],  in  that  here  the  query  point  x  can  be  any  point  inside  P,  whereas 
in  [CG85]  it  is  required  to  lie  on  the  boundary  of  P. 

Finally,  we  consider  Problem  III.  Here  we  make  use  of  the  duality  between  rays 
emanating  from  e  emd  points  in  the  two-sided  plane  (2SP  for  short),  as  described 
in  [GRS83,CG85].  Following  [CG85],  we  will  produce  a  partitioning  11  of  the  2SP 
into  convex  regions,  each  region  containing  the  duals  of  all  rays  emanating  from  e 
and  hitting  the  same  edge  of  P.  The  same  partitioning  is  obtained  in  [CG85]  in 
0(n  log  n)  time;  we  show  here  how  to  obtain  it  in  linear  time. 

To  this  end  we  make  use  of  the  analysis  of  Problem  II  given  above.  Let  e'  —  cd 
be  another  edge  of  P  that  contains  points  visible  from  e.  As  above,  we  know  that 
the  two  paths  7r(a,c),  ir(b,d)  axe  inward  convex.  Let  x^,  wz  be  the  two  common 
tangents  to  these  paths,  where  x,  r  G  and  w,y  G  ir(b,d).  Let  P(e')  be  the 

region  in  11  corresponding  to  e^  Then  the  boundary  of  consists  of  points  that 
are  duals  of  rays  r  emanating  from  e  and  hitting  points  on  e' ,  such  that  r  passes 
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through  a  vertex  of  P  (which  can  be  either  one  of  the  endpoints  a,  b,  c,  d,  or  another 
vertex  of  P  that  r  “grazes”  on  its  way  from  e  to  e'.  It  follows  from  the  preceding 
analysis  that  such  a  vertex  must  lie  on  one  of  the  paths  7r(a,  c),  ir{b,d),  or,  more 
precisely,  on  one  of  their  subpaths  7r(a;,  z),  7r(ti>,  y). 

Moreover,  each  vertex  of  R{e')  corresponds  either  to  a  ray  that  passes  through 
two  vertices  of  P  that  are  adjacent  in  one  of  the  subpaths  Tr(x,  z),  y),  or  to  one 

of  the  two  extreme  rays  xy,  wz.  It  is  also  easy  to  establish  adjacency  of  the  vertices 
of  R(e')  along  its  boundary.  Specifically,  two  such  vertices  must  correspond  to  two 
rays  passing  respectively  through  fg  and  y/i,  where  /,  y,  h  are  three  vertices  of  P 
that  are  either  adjacent  along  one  of  the  paths  7r(x,  z),  ?r(iy,  y),  or  are  such  that  g  is 
one  of  w,  X,  y,  or  z,  /  is  adjacent  to  g  along  xy  or  uTz,  and  h  is  adjax:ent  to  g  along 
7r(x,  z)  or  ir{w,  y). 

The  preceding  arguments  imply  that  the  toted  number  of  vertices  in  11  is  at 
most  proportional  to  the  sum  of  the  sizes  of  the  funnels  for  the  edges  of  P  that  are 
obtained  during  execution  of  the  shortest  path  algorithm  of  Section  4.1.  This  sum 
is  linear  in  n,  which  implies  that  11  has  only  0(n)  vertices  (see  also  [CG85]). 

It  is  also  easy  to  calculate  adjacency  of  regions  in  11.  Specifically,  it  suffices  to 
consider  adjacency  of  regions  near  a  vertex  r  of  11.  By  the  preceding  emalysis,  r  is 
the  dual  of  a  ray  r  emanating  from  e  and  passing  through  two  vertices  y,  h  of  P.  We 
can  apply  a  simple  local,  though  somewhat  lengthy,  case  analysis  (which  requires 
only  0(1)  time),  to  enumerate  all  possible  pairs  of  edges  e',  e"  whose  regions  P(e'), 
P(e'')  in  n  are  adjacent  near  r;  generally,  each  of  these  edges  will  either  lie  adjacent 
to  y  or  /i,  or  contain  one  of  the  endpoints  of  r,  if  this  endpoint  is  different  from  y 
and  h.  We  leave  details  of  this  case  analysis  to  the  reader. 

All  these  observations  imply  that  11  can  be  calculated  in  linear  time  from  the 
output  of  the  executions  of  the  shortest  path  algorithm  of  Section  4.1  with  a  and  b 
as  sources.  Having  calculated  H,  we  next  apply  to  it  one  of  the  linear-time  pre¬ 
processing  algorithms  for  point  location  [Kir83,EGS86],  obtaining  a  data  structure 
from  which  the  region  of  11  containing  (the  dual  of)  any  query  ray  r,  Eind  thus  also 
the  edge  of  P  first  hit  by  r,  can  be  found  in  O(log  ra)  time. 

Remark:  The  case  analysis  in  the  construction  of  11  can  be  avoided  if  we  work 
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in  dual  space  from  the  beginning,  as  in  [CG85].  The  rays  passing  through  edge  e 
dualize  to  a  convex  region  of  the  two-sided  plane.  We  explore  the  triangulation  one 
triangle  at  a  time,  as  in  Section  4.1;  instead  of  splitting  funnels,  we  subdivide  convex 
polygonal  regions  in  dual  space.  Finger  search  trees  can  be  used  as  in  Section  4.1 
to  mahe  the  overall  construction  time  hnear. 


4.4  Conclusion 

This  chapter  has  presented  a  collection  of  linear-time  algorithms  for  solving  a  va¬ 
riety  of  shortest  path  and  visibility  problems  inside  a  triangulated  simple  polygon, 
exploiting  interesting  relationships  between  these  two  types  of  problems.  AH  the 
algorithms  axe  based  on  a  single  structure,  the  shortest  path  tree.  One  of  the  most 
useful  derivatives  of  the  shortest  path  tree  is  the  shortest  path  map,  which  rep¬ 
resents  all  the  shortest  paths  from  a  fixed  source  vertex  to  other  points  in  P.  In 
the  following  chapter  we  use  the  shortest  path  map  as  a  tool  to  build  the  visibility 
graph  of  a  simple  polygon. 


Chapter  5 


Finding  the  Visibility  Graph  of  a 
Simple  Polygon  in  Time 
Proportional  to  its  Size 


Write  the  vision,  and  make  it  plain  upon  tables, 
that  be  may  run  that  readetb  it 

—  Habakkuk  2:2 


As  described  in  Chapter  2,  the  visibility  graph  of  a  set  S’  of  n  non-intersecting  line 
segments  in  the  plane  records  each  pair  of  segment  endpoints  that  can  be  connected 
by  an  open  segment  free  of  intersections  with  S.  If  we  regard  the  segments  as 
opaque  walls,  the  visibility  graph  records,  for  every  wall  endpoint  v,  what  other 
wall  endpoints  would  be  seen  by  an  observer  standing  at  v.  As  its  name  suggests, 
the  visibility  graph  is  a  combinatorial  graph  structure;  its  nodes  axe  the  segment 
endpoints,  and  its  edges  connect  mutually  visible  segment  endpoints. 

Although  the  visibility  graph  algorithms  of  Chapter  2  find  the  visibility  graph  in 
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0(n*)  time,  the  visibility  graph  itself  may  be  as  small  as  0(n).  This  seems  to  suggest 
some  inefficiency  in  the  algorithms;  a  linear  amount  of  work  is  being  expended  to 
find  each  visibility  graph  edge.  It  would  be  desirable  to  have  an  algorithm  whose 
running  time  depended  on  the  size  of  its  output. 

It  is  possible  to  improve  on  the  0(n^)  algorithms  when  5  has  some  special 
property.  For  example,  when  the  segments  in  S  form  a  collection  of  k  convex 
polygons,  it  is  not  hard  to  find  the  visibility  graph  in  time  proportional  to  the 
output  size  plus  0{nklogn).  This  gives  an  log^^^ n)  best-case  algorithm. 

In  this  chapter  we  consider  the  case  in  which  the  segments  form  a  simple  polygon 
with  no  consecutive  collinear  edges.  By  convention,  the  visibility  graph  of  a  simple 
polygon  P  contains  only  the  visibility  edges  inside  P.  If  we  need  the  complete 
visibility  graph,  we  can  construct  it  by  computing  the  convex  hull  of  P,  then  finding 
the  visibility  graph  inside  P  and  inside  each  of  the  bays  cut  off  by  non-polygon  edges 
of  the  hull.  We  define  m  to  be  the  number  of  visibility  edges  inside  P.  Because  P 
can  be  triangulated,  m  is  at  least  n  —  3. 

A  recent  algorithm  of  Stiri  finds  the  visibility  graph  of  a  simple  polygon  in  time 
0(m  log  n)  [Sur86b].  The  method  uses  0(m)  “shooting”  queries:  given  a  direction  d 
and  a  point  q  inside  P,  each  query  asks  for  the  first  point  of  P  intersected  by  the 
ray  from  q  with  direction  d.  Each  query  takes  O(logn)  time  to  answer  [CG85]. 

This  chapter  shows  how  to  find  the  visibility  graph  of  a  simple  polygon  P  in 
time  0(m  -|-  n  log  log  n).  The  algorithm  exploits  the  close  relationship  between 
the  visibility  graph  zind  shortest  paths  inside  P:  shortest  paths  follow  visibility 
edges,  and  visibility  edges  are  shortest  paths  between  the  points  they  connect. 
The  algorithm  computes  all  the  shortest  paths  from  each  polygon  vertex  to  other 
vertices,  then  picks  out  the  single-edge  paths  as  visibility  edges. 

At  first  glance,  this  approach  seems  extraordinarily  inefficient.  Why  should 
we  compute  shortest  paths  with  many  turns  when  only  the  single-edge  paths  are 
important?  The  answer  is  that  the  set  of  all  shortest  paths  has  more  structure  than 
the  set  of  single-edge  paths.  Once  we  have  found  the  set  of  all  shortest  paths  from 
one  vertex,  we  can  exploit  the  structure  to  find  similar  sets  for  the  other  vertices 
efficiently. 
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For  each  polygon  vertex  v,  we  use  the  shortest  path  map  SPM{P,v)  to  repre¬ 
sent  the  shortest  paths  from  v  to  the  other  vertices.  As  noted  in  Section  4.3,  the 
vertices  visible  from  v  are  its  neighbors  in  SPM{P,v)’,  given  a  reasonable  represen¬ 
tation  of  SPM (P,  v),  we  can  find  them  in  constant  time  apiece.  If  we  constructed 
each  shortest  path  map  individurdly,  we  would  spend  0(n)  time  on  each  one,  and 
it  would  take  0(n^)  time  to  find  the  visibility  graph.  However,  we  can  save  time 
by  building  the  shortest  path  map  for  an  arbitrarily  selected  vertex  vi ,  then  trans¬ 
forming  it  to  get  all  the  other  shortest  path  maps.  Once  we  have  SPM(P,vi),  we 
modify  it  to  produce  SPM{P,v-i)^  where  V2  is  the  coimterclockwise  neighbor  of  vi 
along  the  polygon  boundary.  We  repeat  this  operation  for  all  the  vertices,  working 
cormterclockwise  around  the  polygon. 

We  build  the  first  shortest  path  map  by  triangulating  P,  then  running  the 
algorithm  of  the  preceding  chapter.  The  triangulation  algorithm  of  Tarjan  and 
Van  Wyk  takes  O(nloglogn)  time  [TV86],  which  dominates  the  initialization  cost. 

In  the  rest  of  this  chapter,  we  give  algorithms  to  produce  all  the  shortest  path 
maps  from  SPM{P,Vi)  in  0(m)  time.  Section  5.1  reveals  the  similarity  between 
shortest  path  maps  for  adjacent  vertices  by  characterizing  their  differences.  Sec¬ 
tion  5.2  gives  two  algorithms  for  obtaining  one  such  shortest  path  map  from  the 
other  in  time  proportional  to  the  number  of  differences  between  them.  Section  5.3 
shows  that  the  total  number  of  differences  between  adjacent  shortest  path  maps  is 
0(m),  which  implies  that  our  algorithm  takes  0{m  +  n  log  log  n)  time  altogether. 
This  is  slightly  suboptimal  when  m  =  o(n  log  log  n),  but  it  is  still  a  significant 
improvement  over  earlier  methods. 


5.1  Shortest  path  maps 

Shortest  path  maps  were  introduced  in  Section  4.2;  in  this  section  we  give  additional 
properties  of  shortest  path  maps  that  we  need  for  our  visibility  graph  algorithm. 
Since  our  algorithm  transforms  one  shortest  path  map  into  another,  we  characterize 
the  differences  between  two  shortest  path  maps  whose  sources  axe  adjacent  polygon 
vertices.  Let  us  introduce  some  notation  to  help  talk  about  shortest  paths.  We 
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have  already  defined  ir(s,v)  to  be  the  unique  shortest  path  from  s  to  u  that  does 
not  go  outside  P.  We  now  pick  out  a  key  point  on  the  path:  l(s,  v)  is  the  polygon 
vertex  on  7r(s,  v)  that  is  not  equal  to  v  but  closest  to  it  (/  stands  for  the  last  vertex 
on  the  path).  We  assume  in  this  chapter  that  s  is  a  polygon  vertex,  so  l(s,v)  is 
always  well-defined.  For  all  points  v  inside  a  triangular  region  of  the  shortest  path 
map  SPM(P,v),  l(s,v)  is  constant  and  equal  to  the  region’s  apex. 

In  the  rest  of  this  chapter  we  refer  to  shortest  path  maps  repeatedly;  since  all 
our  shortest  path  maps  are  subdivisions  of  P,  we  use  the  notation  SPM{s)  in  place 
of  the  notation  SPM{P,s).  We  represent  5PAf(s)  using  a  standard  subdivision 
representation,  such  as  the  winged-edge  data  structure  of  Batimgart  [Bau75]  or 
the  quad-edge  data  structure  of  Guibas  and  Stolfi  [GS85].  The  data  structure  lets 
us  find  the  neighbors  of  s  in  SPM{s)  in  constant  time  apiece,  which  gives  us  the 
visibility  edges  with  s  as  an  endpoint  in  constant  time  per  edge.  The  algorithms 
of  Section  5.2  must  be  able  to  find  the  base  of  a  region  in  constant  time,  so  we 
augment  the  data  structure  by  storing  at  the  apex  of  each  region  a  pointer  to  its 
base.  When  the  algorithms  need  to  find  the  base  of  a  region,  they  already  know  its 
apex. 

Let  s  and  t  be  adjacent  polygon  vertices,  s  clockwise  of  t.  Given  SPM{s),  the 
algorithms  of  the  following  section  produce  SPM (t)  by  adding  and  deleting  edges. 
The  following  lemma  cheu'acterizes  the  part  of  the  subdivision  that  they  change. 

Lemma  5.1.  Let  s  and  t  be  adjacent  vertices  of  P,  and  let  e  be  an  edge  of  SPM{s) 
that  does  not  belong  to  SPM{t).  Then  there  is  a  point  v  in  the  interior  of  e  such 
that  any  neighborhood  of  v  includes  points  that  are  visible  from  the  interior  of  st. 

Proof:  The  edge  e  separates  two  regions  of  5PM(s),  and  hence  l(s,  a)  ^ 

/(s,  b)  for  a  and  b  on  opposite  sides  of  e.  Because  e  does  not  belong  to 
SPM(t),  we  can  pick  a  point  v  in  the  interior  of  e  that  is  not  part  of  an 
edge  of  SPM{t).  Then  for  a  and  b  on  opposite  sides  of  e  and  sufficiently 
close  to  V,  we  have  /(t,a)  =  /(t,  b).  By  possibly  interchanging  a  and  6, 
we  can  insure  that  l{t,  a)  ^  /(s,  a). 

Now  consider  the  shortest  path  tree  of  a.  Since  ^  l(s,a),  a 
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is  the  apex  of  the  funnel  associated  with  segment  It.  The  funnel  is 
evidence  that  some  part  of  the  interior  of  si  is  visible  from  a.  Any  ray 
with  source  a  and  direction  strictly  in  the  angle  formed  by  /(s,  a),  a, 
and  l{t,  a)  must  hit  the  interior  of  si  before  it  hits  any  other  part  of  P. 

I 

The  part  of  P  visible  from  at  leeist  one  point  in  the  interior  of  s?  is  a  connected 
subpolygon,  the  visibility  polygon  Vm(P,  si).  Following  our  convention  for  shortest 
path  maps,  we  drop  P  from  the  name  and  write  Vis(st)  instead  of  Vis(P,si). 
Lemma  5.1  shows  that  SPM(s)  and  SPM{t)  differ  only  inside  Vis(3i)  and  on  its 
boundary.  The  following  lemma  characterizes  the  boundary  of  Vis  (si). 

Lemma  5.2.  The  boundary  of  V’w(s?)  is  made  up  of  edges  of  SPM{s)  and  SPM(t). 
Any  edge  ab  that  is  on  the  boundary  of  Kw(s<)  but  not  on  the  botmdary  of  P  has 
one  endpoint,  say  a,  that  lies  on  the  paths  to  b  from  s  and  t:  it  is  both  l(s,  b)  and 
l(t,  b).  If  a  is  clockwise  of  6  on  the  boundary  of  P,  then  ab  belongs  to  SPM(s); 
otherwise,  it  belongs  to  SPM(t). 

Proof;  The  edges  of  P  belong  to  both  SPM(s)  and  SPM(t);  thus  the 
polygon  edges  bn  the  botmdary  of  Vis  (si)  belong  to  both  shortest  path 
maps.  We  therefore  consider  only  botmdary  edges  of  Vis  (si)  in  the 
interior  of  P. 

A  point  V  in  the  interior  of  such  an  edge  is  not  visible  from  it,  since 
it  is  not  in  the  interior  of  Vis(H).  As  noted  in  the  proof  of  Lemma  5.1, 
we  must  have  l(s,v)  =  l(t,v).  Let  us  define  a  to  be  l{s,v).  The  point  a 
is  an  endpoint  of  the  edge  that  contains  u;  let  6  be  the  other  endpoint. 

The  shortest  paths  from  6  to  s  and  t  must  go  through  a,  and  so  a  is 
both  /(s,  6)  and  l{t,  b). 

Suppose  that  a  is  clockwise  of  b.  This  means  that  the  interior  of 
Vis(si)  lies  to  the  left  of  ab.  Let  v'  be  a  point  just  to  the  left  of  v,  inside 
Vis(si).  (See  Figure  5.1.)  Because  u'  is  visible  from  st,  the  two  shortest 
paths  x(s,v')  and  7r(t,v')  intersect  only  at  u'.  Vertex  a  is  on  or  to  the 
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right  of  ir(t,  v'),  so  l(s,  v')  ^  a.  This  holds  for  v'  distinct  from  v  but 
arbitrarily  close  to  it.  Since  /(s,  u)  =  a,  the  segment  ab  is  an  edge  of 
5PM(s);  it  separates  two  regions  of  SPM{s). 

b 


s  t 

Figure  5.1.  Because  v*  is  visible  from  st  and  v  is  not,  l(s,v)  ^  l{3,v'). 

This  implies  that  ab  is  an  edge  of  SPM(s). 

If  a  is  counterclockwise  of  b,  then  the  interior  of  Viw(3?)  lies  to  the 
right  of  ab.  An  argument  similar  to  the  preceding  one  shows  that  ab  is 
an  edge  of  SPM (t).  | 

5.2  Constructing  shortest  path  maps 

This  section  presents  two  dissimilar  ways  to  implement  the  general  step  of  the 
algorithm,  the  construction  of  one  shortest  path  map  from  that  of  an  adjacent 
vertex.  Both  methods  take  time  proportional  to  the  number  of  differences  between 
the  two  shortest  path  maps.  Section  5.3  shows  that  the  number  of  differences, 
summed  over  all  neighbor  pairs,  is  proportional  to  the  size  of  the  visibility  graph. 
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5.2.1  A  sweeping-path  algorithm 

The  first  transformation  algorithm  might  be  called  a  “sweeping-path”  algorithm. 
It  moves  a  point  p  from  t  to  s  along  the  boundary  of  the  subpolygon  V^w(st),  the 
region  in  which  one  shortest  path  map  must  be  replaced  by  another.  We  think 
of  the  point  as  being  connected  to  s  and  t  by  two  rubber  bands  inside  P,  the 
shortest  paths  7r(s,p)  and  ?r(t,p).  As  the  point  moves,  it  drags  the  two  paths  along 
with  it,  and  the  paths  sweep  over  the  subdivision  inside  Vt3(5t).  The  algorithm 
uses  the  sweeping  paths  to  change  5P Jlf(s)  into  SPM{t).  The  algorithm  maintains 
the  following  invariant:  on  and  to  the  right  of  ir(t,p),  the  subdivision  represents 
SPM(t);  to  the  left  of  ir(s,p),  the  subdivision  represents  SPM{s);  between  the  two 
paths,  there  are  no  edges. 

The  algorithm  updates  the  subdivision  when  p  visits  vertices  of  SPM(s)  or 
SPM{t).  If  v  is  a  vertex  of  5PJlf(s),  all  the  edges  of  7r(s,t;)  are  edges  of  SPM{s). 
When  p  visits  v,  the  algorithm  deletes  the  last  edge  of  the  path  ir(s,v),  unless  it  is 
also  an  edge  of  SPM{t).  (An  edge  may  belong  to  both  shortest  path  maps  only  if 
it  is  a  boundary  edge  of  Vm(5?).)  Similarly,  when  p  visits  a  vertex  v  of  SPM(t), 
the  algorithm  adds  the  last  edge  of  ir(<,w)  to  the  subdivision,  imless  it  is  already 
an  edge  of  SPM(s).  This  procedure  considers  just  once  each  edge  that  must  be 
changed. 

When  the  algorithm  adds  or  deletes  an  edge,  it  may  have  to  make  other  changes 
to  the  subdivision.  Suppose  that  the  algorithm  deletes  an  edge  of  SPM{s)  when  p 
visits  V.  If  V  is  a  vertex  of  5PJl/(s)  but  not  of  P,  then  v  splits  a  polygon  edge;  the 
algorithm  must  merge  the  two  pieces  back  into  one.  Similarly,  when  the  algorithm 
adds  an  edge  of  SPM{t)  to  the  subdivision,  it  may  have  to  split  a  polygon  edge: 
if  p  is  not  at  a  polygon  vertex,  the  algorithm  splits  the  polygon  edge  on  which  p 
lies.  Since  adding  and  deleting  edges  changes  regions  in  the  current  subdivision,  we 
must  update  the  base  pointer  in  each  changed  region.  This  is  not  hard:  whenever 
a  pointer  needs  to  be  changed,  the  new  base  is  incident  to  p,  and  the  region’s  apex 
is  connected  to  p  by  a  straight  segment  of  7r(s,p)  or  7r(<,p). 

How  do  we  represent  p  and  the  two  paths  in  a  discrete  algorithm?  The  point  p 
does  not  move  continuously  along  the  boundary  of  Vis(si),  but  in  discrete  jumps 
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from  one  vertex  of  SPM (s)  or  SPM (t)  to  the  next  such  vertex.  Because  the  shortest 
paths  to  intermediate  vertices  on  7r(s,p)  and  Tr{t,p)  are  unique,  the  movement  of  p 
changes--7r(s,p)  and  7r(t,p)  only  at  the  ends  nearer  p.  Thus  we  can  represent  the 
two  paths  as  stacks  of  polygon  vertices.  The  stacks  do  not  include  p:  the  top  of  the 
stack  for  x'(s,p)  is  l(s,p),  and  the  top  of  the  stack  for  ‘K(t,p)  is  l(t,p). 

We  have  described  the  basic  idea  of  the  algorithm,  as  well  as  its  data  structures. 
We  now  describe  the  actions  necessary  to  move  p  along  the  boundary  of  Vis(^. 
These  can  be  broken  down  into  several  interrelated  tasks.  The  hardest  two  tasks 
are  deciding  which  edge  p  should  follow  when  it  leaves  a  vertex  and  deciding  when 
it  should  stop  moving  along  the  edge.  By  referring  to  the  two  shortest  paths  Jr(s,p) 
and  7r(t,p),  the  algorithm  can  make  these  decisions  in  constant  time  apiece.  A 
simpler  task  is  that  of  maintaining  the  stacks  that  represent  the  shortest  paths.  We 
show  how  to  perform  these  tasks,  taking  into  account  the  possibility  of  degeneracies 
in  P.  We  then  combine  the  costs  of  these  operations  to  bound  the  algorithm’s 
running  time. 

The  region  visible  from  the  interior  of  st,  Vis(sF),  is  bounded  by  polygon  edges 
and  by  extension  edges  of  the  types  shown  in  Figure  5.2.  (An  extension  edge  is 
a  shortest  path  map  edge  that  does  not  belong  to  the  shortest  path  tree.  It  is 
obtained  by  extending  a  funnel  edge.)  The  algorithm  for  following  the  boimdary  of 
Vis  (si)  is  simple:  p  moves  comterclockwise  along  the  polygon  boimdary  whenever 
doing  so  leaves  both  shortest  paths  7r(s,p)  and  x{t,p)  inward  convex.  When  moving 
along  the  polygon  boundary  would  cause  one  of  these  two  paths  to  violate  inward 
convexity,  as  at  vertices  v  and  w  in  Figure  5.2,  p  moves  along  an  extension  edge. 
It  moves  along  edges  of  SPM{s)  when  moving  away  from  si  and  along  edges  of 
SPM(t)  when  moving  toward  st,  as  specified  by  Lemma  5.2.  In  Figiure  5.2,  when  p 
advances  from  v,  it  moves  along  the  extension  of  the  edge  from  l{s,v)  through  v. 
When  p  advances  from  w,  it  moves  along  the  extension  edge  between  w  and  l(t,w). 

The  movement  of  p  fits  in  with  the  edge  addition  and  deletion  scheme  outlined 
above.  Whenever  p  moves  along  an  extension  edge  of  5PJlf  (s),  the  edge  still  belongs 
to  the  subdivision.  When  p  moves  along  an  extension  edge  of  SPM (t),  the  edge  has 
just  been  added  to  the  subdivision. 
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Figure  5.2.  Vis  (st)  is  bounded  by  polygon  edges  and  by  extension  edges. 

The  left  extension  edge  is  an  edge  of  SPM (t);  the  right  edge  belongs  to 
SPM(s). 

We  can  tell  which  edge  p  should  follow  when  it  leaves  a  vertex;  we  need  to 
know  when  to  stop  its  motion  along  the  edge.  We  break  the  problem  into  two 
subproblems.  We  consider  moving  p  along  extension  edges  below;  here  we  show 
how  to  move  p  along  edges  of  P  that  are  visible  from  si.  When  moving  along  such 
an  edge,  p  should  stop  at  the  first  vertex  of  SPM{s)  or  SPM{t)  on  the  edge.  It  is 
easy  to  detect  vertices  of  SPM(s):  they  are  vertices  of  the  subdivision  left  of  7r(s,p). 
Recognizing  vertices  of  SPM(t)  that  axe  not  vertices  of  SPJlf(s)  is  more  difficult. 

The  problem  we  need  to  solve  is  quite  specific:  given  a  point  p  on  the  boundary 
of  FM(st)  such  that  the  polygon  edge  counterclockwise  of  p  is  visible  &om  si,  find 
the  first  vertex  of  SPM (s)  or  SPM (t)  counterclockwise  of  p  along  the  edge.  Let  q 
be  the  first  vertex  of  SPM  (a)  counterclockwise  of  p;  it  is  part  of  the  subdivision. 
We  need  to  determine  whether  any  vertices  of  SPM(t)  lie  in  the  interior  of  edge  pq. 

We  begin  by  characterizing  vertices  of  SPM{t)  that  lie  between  p  and  q,  assuming 
that  at  lezist  one  exists.  Let  v  be  such  a  vertex,  chosen  closest  to  p  if  there  are  several 
such  vertices.  Vertex  u  is  an  endpoint  of  an  extension  edge  e.  The  edge  separates 
two  regions  of  SPM{t);  v  lies  on  the  base  of  both.  The  apexes  of  the  two  regions  lie 
on  the  line  containing  c.  Because  v  is  not  a  polygon  vertex,  the  two  apexes  cannot 
be  coincident. 
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First  consider  the  case  in  which  the  apex  to  the  right  of  e  is  closer  to  v  than  the 
apex  to  the  left,  as  in  Figure  5.3(a).  (We  define  right  and  left  by  assuming  that  e 
is  directed  away  from  t.)  Because  p  lies  on  the  boundziry  of  the  region  on  the  right, 
the  last  two  vertices  of  lie  on  the  segment  ab  of  Figure  5.3(a).  (In  degenerate 

cases  several  polygon  vertices  may  lie  on  ab.)  Hence  v  can  be  determined  as  the 
intersection  of  the  ray  through  the  last  two  vertices  on  ir{t,p)  with  the  edge  pq. 


(a)  (b) 


Figure  5.3.  The  point  t;  is  a  vertex  of  SPM(t)  that  lies  between  p  and  q, 
the  vertex  of  SPM{s)  that  follows  p  on  the  boundary  of  P.  Figures  (a) 
and  (b)  show  the  two  configurations  of  SPM(t)  that  give  rise  to  such  a 
vertex. 

Now  consider  the  case  in  which  the  apex  to  the  left  of  e  is  closer  to  v  than  the 
apex  to  the  right,  as  in  Figme  5.3(b).  Because  7r(s,i;)  cannot  lie  to  the  left  of  the 
polygon  vertex  b  or  to  the  right  of  Tr{t,v),  it  must  go  through  b.  Thus  b  is  l{s,v). 
The  region  of  SPM{t)  to  the  right  of  e  has  a  as  its  apex;  that  is,  l{t,  p)  =  a.  Because 
there  axe  no  vertices  of  SPM(s)  between  p  and  q,  the  last  vertex  on  7r(s,p),  l{s,p), 
is  either  equal  to  6  or  to  a  polygon  vertex  in  the  interior  of  ab.  (Because  pq  is  visible 
from  si,  l{s,p)  cannot  be  a.  It  is  not  equal  to  b  only  in  the  case  of  degeneracies.) 
Hence  v  can  be  determined  as  the  intersection  of  the  ray  through  l{t,p)  and  l{s,p) 
with  the  edge  pq. 

To  detect  vertices  of  SPM{t)  on  the  edge  pq,  we  need  to  compute  only  two 
intersections.  If  the  ray  through  the  last  two  vertices  on  7r(<,p)  intersects  or 
the  ray  through  l{t,p)  and  l{s,p)  intersects  pq,  then  we  move  p  to  the  intersection 
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farthest  clockwise;  it  is  a  vertex  of  SPM(t).  Otherwise  we  move  p  to  q.  (If  l(t,p)  =  t, 
the  first  ray  is  undefined  and  its  intersection  does  not  exist.  If  either  ray  hits  pq, 
the  ray  farther  clockwise  hits  pq  without  hitting  any  other  part  of  P  first;  this  is 
because  the  quadrilateral  determined  by  p,  g,  l(s,p),  and  l(t,p)  is  free  of  polygon 
points.)  See  Figure  5.4  for  an  example  of  these  intersections. 


(a) 


(b) 


Figure  5.4.  The  circled  points  are  vertices  of  SPM{t)  that  do  not  appear 
in  the  subdivision  left  of  7r(s,  p).  They  are  detected  when  the  algorithm 
tries  to  move  p  along  the  polygon  boundary  to  q.  (Vertices  I  and  r  are  the 
last  polygon  vertices  on  Jr(s,p)  and  7r(t,p),  respectively.  Vertex  u  is  the 
predecessor  of  r  on  ir(t,  p).)  In  (a),  the  circled  vertex  is  the  intersection 
of  pq  with  the  extension  of  Sr.  Vertex  r  is  removed  from  ir(t,p).  In 
(b),  p  cannot  move  past  the  intersection  of  rl  with  pq  (circled)  without 
violating  the  inward  convexity  of  ir(t,p). 

We  have  described  how  the  algorithm  moves  p  along  an  edge  visible  from  si; 
we  now  describe  how  it  moves  p  along  an  extension  edge.  Extension  edges  require 
care  because  they  may  have  multiple  polygon  vertices  along  them,  even  though  no 
shortest  path  map  edges  need  to  be  changed.  This  degenerate  case  is  depicted  in 
Figture  5.5.  The  algorithm  handles  extension  edges  specially  to  avoid  having  to 
visit  many  vertices  when  changing  only  a  few  shortest  path  map  edges.  It  uses 
the  base  pointer  stored  at  the  apex  of  each  region.  The  apex  is  the  last  vertex 
on  7r(s,p)  or  7r(<,p);  the  algorithm  follows  its  pointer  to  skip  over  extension  edges, 
moving  directly  from  the  first  vertex  to  the  base,  or  vice  versa.  In  Figure  5.5,  p 
moves  directly  from  u  to  u  and  from  w  to  x.  If  there  are  intermediate  vertices  on 
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the  extension  edge  (which  can  be  checked  in  constant  time),  the  edges  along  the 
extension  axe  the  same  in  both  SPM(s)  and  SPM(t).  If  there  are  no  intermediate 
vertices,  the  edge  is  added  or  deleted  in  constant  time. 


u 


Figure  5.5.  In  degenerate  cases,  multiple  polygon  vertices  may  lie  on 
one  extension  edge  on  the  boimdaxy  of  Vis(st).  None  of  the  shortest 
path  map  edges  along  the  extension  edge  needs  to  be  changed,  so  the 
algorithm  skips  over  them. 

The  shortest  paths  ?r(s,p)  and  ir(t,p)  are  not  hard  to  maintain.  They  change 
only  when  p  passes  through  a  vertex  of  SPM{s)  or  SPM(t).  When  p  reaches  a 
vertex  of  SPM(t)  of  the  type  shown  in  Figure  5.4(a),  then  r,  the  last  vertex  of 
7r(<,p),  is  popped  off  the  top  of  its  stack.  When  p  advances  from  v  along  edge  vq 
emd  q  is  to  the  right  of  fu,  then  v  is  pushed  onto  the  stack  for  7r(t,p).  Changes  to 
the  stack  for  7r(s,p)  axe  made  analogously.  The  two  shortest  paths  change  only  in 
these  four  situations. 

The  algorithm  runs  in  time  proportional  to  the  niunber  of  vertices  p  visits.  The 
only  vertices  p  visits  where  the  algorithm  does  not  modify  the  current  subdivision 
are  vertices  v  where  l{s,v)  =  l{t,v)  and  the  edge  from  v  to  l{s,v)  belongs  to  both 
SPM(s)  and  SPM(t).  Such  a  vertex  is  one  endpoint  of  an  extension  edge  on  the 
boundary  of  Fw(st);  it  is  a  “shadow”  of  a  vertex  visible  from  st.  The  algorithm 
changes  the  subdivision  at  the  vertex  of  which  v  is  a  shadow.  Since  each  vertex 
where  the  algorithm  does  change  the  subdivision  has  at  most  one  such  shadow 
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vertex,  we  have  the  following  theorem. 

Theorem  5.1.  The  sweeping-path  algorithm  computes  SPM(t)  from  SPM(s)  in 
time  proportional  to  the  number  of  differences  between  the  two  shortest  path  maps. 

5.2.2  A  depth-first  search  algorithm 

Our  second  transformation  algorithm  is  a  depth-first  search  algorithm;  it  is  based 
on  the  shortest  path  map  construction  of  Chapter  4.  Because  it  uses  finger  search 
trees  [GMPR77,HM82],  it  may  be  harder  to  implement  than  the  sweeping-path 
algorithm.  The  idea  behind  the  depth-first  search  algorithm  is  not  diffictilt:  the  al¬ 
gorithm  of  Section  4.1,  which  runs  in  a  triangulated  simple  polygon,  can  be  modified 
to  rvm  using  a  shortest  path  map  in  place  of  the  triangulation.  We  build  SPM{i) 
by  running  the  modified  algorithm  in  SPM (s).  The  algorithm  changes  only  the  re¬ 
gions  overlapping  it  runs  in  time  proportional  to  the  nmnber  of  differences 

between  SPM(s)  and  SPM{t). 

The  algorithm  of  Chapter  4  is  a  depth-first  exploration  of  a  trieuigulation.  It 
splits  the  current  funnel  each  time  it  enters  a  new  triangle;  when  it  finishes,  it  has 
computed  a  ftmnel  for  each  polygon  edge.  Extending  the  edges  on  the  funnel  sides 
gives  the  shortest  path  map  decomposition  of  the  polygon. 

Because  the  polygon  is  subdivided  into  triangles,  the  algorithm  splits  a  funnel 
in  two  when  it  enters  a  region.  However,  funnel-splitting  works  equally  well  if  the 
polygon  is  subdivided  into  convex  polygons  with  more  than  three  sides.  When  the 
algorithm  crosses  a  diagonal  to  enter  a  convex  subpolygon  with  k  sides,  it  splits  the 
diagonal’s  funnel  into  A;  —  1  subfunnels. 

Each  region  of  a  shortest  path  map  is  convex,  so  we  can  build  SPM{t)  by 
running  the  algorithm  of  Chapter  4  in  SPM(s)  instead  of  in  a  triangulation.  Because 
SPM{s)  has  vertices  that  do  not  belong  to  P,  the  resulting  shortest  path  map 
has  extra  edges  that  we  then  have  to  delete.  A  second  potential  problem  arises 
because  regions  of  SPM(s)  may  be  degenerate.  A  region  may  have  arbitrarily 
many  edges  on  its  botmdary,  even  though  the  edges  lie  on  only  three  lines.  However, 
the  exploration  strategy  outlined  below  ensures  that  degeneracies  do  not  hurt  the 
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algorithm’s  performance. 

Our  algorithm  runs  in  time  proportional  to  the  number  of  differences  between 
SPM {3)  -axid  SPM{t).  To  do  this,  it  explores  only  the  regions  of  SPM(s)  that 
overlap  It  determines  which  regions  to  explore  at  nm  time.  This  is  feasible 

because  the  algorithm  of  Section  4.1  is  incremental:  at  each  step  the  data  structures 
contain  the  correct  shortest  path  tree  for  the  subpolygon  explored  so  far. 

The  algorithm  begins  by  exploring  the  region  incident  to  st.  In  the  rest  of 
the  polygon,  it  crosses  only  diagonals  that  lie  inside  We  show  that  these 

diagonals  are  exactly  those  with  nontrivial  funnels  in  SPT(t),  the  shortest  path  tree 
of  t.  Let  ^  be  a  diagonal  of  SPM{3),  and  without  loss  of  generality  assume  that 
l{s,  b)  =  a.  Then  ab  lies  inside  Vw(5t)  if  and  only  if  l(t,  b)  ^  a,  that  is,  the  funnel 
associated  with  ab  in  SPM(t)  does  not  have  a  as  its  apex.  Because  b  is  the  funnel 
apex  only  for  ab  =  si,  edge  ab  lies  inside  Vi3(^  if  and  only  if  its  funnel  in  SPT(t) 
is  nontrivial. 

When  the  algorithm  enters  a  region  by  crossing  an  edge  e,  it  can  determine  in 
constant  time  whether  any  particular  edge  of  the  region  has  a  trivial  funnel  (equal 
to  the  edge  itself).  An  edge  has  a  trivial  funnel  if  and  only  if  the  tangents  from 
its  endpoints  to  the  interior  of  e’s  funnel  lie  on  the  same  line.  In  such  a  case  the 
tangents  both  pass  through  the  same  endpoint  of  e. 

The  algorithm  cannot  afford  to  look  at  all  the  diagonals  that  it  does  not  cross. 
Fortunately,  it  can  recognize  edges  with  trivial  funnels  without  looking  at  them 
individually.  The  algorithm  enters  each  region  by  crossing  an  edge  c  incident  to  the 
apex  of  the  region.  (This  is  because  s  and  t  are  neighbors  on  P.)  If  there  are  any 
other  edges  on  the  side  of  the  region  that  contains  e,  their  associated  funnels  must 
be  trivial.  (See  Figure  5.6.)  Because  the  apex  of  each  region  in  the  subdivision 
has  a  pointer  to  the  base  of  the  region,  we  can  check  in  constant  time  whether  the 
base’s  funnel  is  trivial.  If  any  edge  on  the  side  of  the  region  opposite  c  has  a  trivial 
funnel,  all  such  edges  do.  We  can  classify  them  all  at  once  by  checking  the  edge 
incident  to  the  apex  of  the  region.  (In  Figure  5.6,  only  the  base  has  a  nontrivial 
funnel.) 

The  algorithm  constructs  SPM (t)  inside  the  region  without  looking  at  the  edges 
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Figure  5.6.  The  algorithm  enters  the  shaded  region  by  crossing  edge  e. 

The  funnels  on  both  sides  of  the  shaded  region  are  trivial. 

with  trivial  funnels.  It  sphts  up  the  funnel  that  belongs  to  e  among  the  other  edges. 
(This  splitting  is  no  more  difficult  if  several  of  the  edges  with  nontrivial  funnels 
lie  on  a  single  side  of  the  region.)  If  any  of  the  edges  with  nontrivial  funnels  lies 
on  the  boundary  of  P,  the  algorithm  divides  its  funnel  up  into  regions  of  SPM{t) 
and  sets  the  base  pointers  at  the  apex  of  each  region.  If  multiple  edges  with  trivial 
funnels  lie  on  a  single  side  of  a  region  of  SPM{s),  then  those  edges  also  belong  to 
SPM(t).  If  only  one  such  edge  lies  on  a  single  side,  it  may  not  belong  to  SPM{t). 
The  cleanup  phase  described  below  deletes  it  if  necessary. 

When  the  algorithm  finishes  its  exploration,  it  has  constructed  a  refinement  of 
SPM{t)  in  a  contiguous  set  of  regions  of  SPM{s).  This  subpolygon  of  P  may  have 
some  vertices  that  do  not  belong  to  P,  and  hence  the  new  shortest  path  map  may 
have  some  vertices  and  edges  that  do  not  belong  to  SPM(t).  The  additional  vertices 
are  vertices  of  SPM(s)  that  split  polygon  edges;  they  lie  on  the  bases  of  regions  of 
SPM{s).  The  additional  edges  axe  incident  to  the  extraneous  vertices.  We  clean  up 
the  shortest  path  map  by  visiting  both  endpoints  of  the  base  of  each  explored  region 
of  SPM{s).  If  the  diagonal  edge  incident  to  such  a  vertex  separates  two  adjacent 
triangular  regions  of  (the  computed)  SPM{i)  that  share  the  same  apex  and  whose 
beises  lie  on  the  same  polygon  edge,  then  the  edge  does  not  belong  in  SP.yI{t);  we 
delete  the  edge,  merge  the  two  regions  into  one,  and  set  the  base  pointer  at  the 
apex  of  the  region.  This  cleanup  operation  removes  all  the  edges  that  do  not  belong 
to  5PAf(<),  both  inside  the  changed  subpolygon  and  on  its  boundary.  It  leaves 
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unchanged  those  edges  that  belong  to  both  SPM{s)  and  SPM{t).  It  runs  in  time 
proportional  to  the  number  of  regions  of  SPM (s)  explored,  since  it  looks  at  at  most 
two  edges  in  each  such  region. 

After  the  cleanup  operation,  the  subpolygon  that  has  been  changed  contains 
SPM{t)\  the  unchanged  part  of  the  polygon  contains  SPM(s)  (and  hence  SPM{t)). 
This  concludes  the  transformation.  We  have  computed  SPM{t),  and  the  new  sub¬ 
division  is  ready  to  be  transformed  again. 

Like  the  algorithm  of  Chapter  4,  this  algorithm  runs  in  time  proportional  to 
the  total  number  of  funnel  edges  it  creates.  Each  funnel-splitting  operation  creates 
one  new  funnel  edge.  The  number  of  funnel-splittings  performed  inside  a  region 
of  SPM(s)  is  proportional  to  the  nvunber  of  edges  with  nontrivial  funnels  on  the 
boundary  of  the  region.  Since  P  does  not  have  consecutive  coUinear  edges,  this 
quantity  is  in  turn  proportional  to  the  munber  of  regions  adjacent  to  the  current  one 
that  are  explored  after  it.  Thus  the  total  number  of  funnel-splittings  is  proportional 
to  the  munber  of  regions  of  SPM (s)  the  sdgorithm  explores. 

The  depth-first  exploration  and  the  cleanup  pheise  both  take  time  proportional 
to  the  number  of  regions  of  SPM(s)  the  algorithm  explores.  Each  region  except  the 
first  is  reached  by  deleting  an  edge  of  SPM{s).  Since  the  number  of  edges  deleted 
is  just  part  of  the  total  number  of  edges  changed,  we  have  the  following  theorem. 

Theorem  5.2.  The  depth-£rst  search  algorithm  computes  SPM(t)  from  SPM{s) 
in  time  proportional  to  the  number  of  differences  between  them. 


5.3  Complexity  bounds 

This  section  shows  that  each  of  the  algorithms  of  the  preceding  section  takes  0(m) 
time  overall.  The  proof  associates  a  constant  number  of  shortest  path  map  edges 
with  each  visibility  edge  or  polygon  edge;  every  edge  that  appears  in  a  shortest  path 
map  has  an  associated  visibility  edge  or  polygon  edge.  As  the  source  of  the  shortest 
path  map  moves  around  the  boundary  of  P,  edges  axe  added  to  and  deleted  from 
the  current  shortest  path  map.  We  show  that  each  edge  is  added  and  deleted  a 
constant  number  of  times,  which  gives  the  desired  bound  on  the  running  time. 
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It  is  helpful  to  think  of  shortest  path  edges  as  being  directed.  The  edge  pq  has 
two  versions,  pq  and  gp.  When  pq  appears  in  shortest  path  map  SPM{v),  we  write 
it  as  pg  if  p  lies  on  the  path  from  v  to  q  and  ^  otherwise. 

We  associate  every  edge  that  appears  in  a  shortest  path  map  with  a  directed 
polygon  edge  or  visibility  edge.  Shortest  paths  follow  visibility  edges  and  polygon 
edges,  so  we  only  need  to  account  for  the  edges  of  the  shortest  path  map  that  axe 
not  in  the  shortest  path  tree.  These  edges  are  extensions  of  funnel  edges.  Each 
funnel  edge  is  a  polygon  edge  or  a  visibility  edge;  we  associate  each  extension  edge 
with  the  version  of  its  fvumel  edge  that  is  directed  toward  it.  Each  directed  edge 
has  at  most  one  extension. 

The  following  lemma  characterizes  the  shortest  path  maps  in  which  each  directed 
edge  appezurs. 

Lemma  5.3.  Let  pq  be  a  polygon  edge  or  a  visibility  edge.  The  shortest  path 
maps  in  which  the  directed  edge  pq  appears  have  sources  that  form  a  contiguous 
subsequence  of  the  polygon  vertices. 

Proof:  We  exhibit  polygon  vertices  /  and  r  sudi  that  pq  appears  in 
SPM{v)  for  all  v  strictly  counterclockwise  of  I  and  clockwise  of  r  in 
the  sequence  of  polygon  vertices,  and  pq  appears  in  no  other  shortest 
path  maps.  We  first  note  that  pq  must  appear  in  SPM{p).  Let  /  be 
the  first  vertex  clockwise  of  p  for  which  ^  is  not  in  SPM{1).  Because 
shortest  paths  do  not  cross,  pq  cannot  appear  in  any  shortest  path  map 
whose  source  is  between  I  and  g,  inclusive.  A  similar  argument  shows 
the  existence  of  r  and  proves  the  lemma.  | 

Now  consider  the  shortest  path  map  transformations  of  the  previous  section. 
If  s?  is  an  edge  of  P,  the  shortest  path  map  SPM(t)  cein  be  constructed  from 
SPM(s)  in  time  proportional  to  the  number  of  edges  that  must  be  changed  to  get 
from  one  to  the  other.  The  preceding  lemma  lets  us  boimd  the  time  needed  to 
produce  shortest  path  maps  for  all  the  polygon  vertices. 

Theorem  5.3.  Given  SPM{v\)  for  any  polygon  vertex  vi,  repeated  application 
of  either  of  the  algorithms  of  Section  5.2  produces  a  shortest  path  map  for  every 
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polygon  vertex  in  0(m)  total  time.  Therefore,  we  can  build  the  visibility  graph 
of  P  in  0(m  +  n  log  log  n)  time. 

Proof:  As  the  source  of  the  shortest  path  map  moves  around  the 
perimeter  of  the  polygon,  each  directed  polygon  or  visibility  edge  is 
added  to  or  deleted  from  the  current  shortest  path  map  at  most  twice. 

(This  follows  from  Lemma  5.3.)  The  extension  edge  associated  with 
the  directed  edge,  if  any,  is  added  and  deleted  along  with  it.  There  are 
2n+2m  directed  edges,  and  hence  by  either  Theorem  5.1  or  Theorem  5.2 
the  n  —  1  applications  of  the  transformation  procedure  take  0(m)  time 
altogether.  | 

5.4  Conclusion  and  open  problems 

This  chapter  has  presented  an  algorithm  to  find  the  visibility  graph  of  a  simple 
polygon  in  time  0{m  +  n  log  log  n),  where  m  is  the  munber  of  edges  in  the  graph. 
The  algorithm  works  by  computing  the  shortest  path  map  from  each  vertex  of  the 
polygon,  then  reading  off  the  visibility  edges  incident  to  the  source  of  the  shortest 
path  map.  Rather  than  compute  each  shortest  path  map  separately,  the  algorithm 
finds  one  shortest  path  map  in  0(n  log  log  n)  time,  then  builds  the  others  as  modi¬ 
fications  of  the  first  one.  In  particular,  the  shortest  path  map  with  source  t  is  built 
by  modifying  the  one  whose  source  is  the  clockwise  neighbor  of  t  along  the  polygon 
boundary.  This  procedure  is  fast  because  of  the  similarity  between  two  shortest 
path  maps  whose  sources  are  neighbors  on  the  polygon. 

Because  it  uses  finger  search  trees  to  find  the  initial  shortest  path  map,  the 
algorithm  may  be  difficult  to  implement.  We  czin  avoid  the  use  of  finger  search 
trees  (at  the  cost  of  slightly  increased  asymptotic  complexity)  by  finding  the  initial 
shortest  path  map  SPM  (ui)  in  0(n  log  n)  time.  We  triangulate  P  with  the  algorithm 
of  Garey  et  al.  [GJPT78].  By  replacing  the  finger  search  trees  in  the  eJgorithm  of 
Section  4.1  with  linked  lists,  we  find  the  shortest  path  map  in  0{n  log  n)  additional 
time.  We  use  the  sweeping-path  algorithm  to  transform  shortest  path  maps  into 
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each  other.  This  yields  an  0(m  +  n  log  n)  algorithm  that  uses  only  simple  data 
structures. 

The  algorithms  of  Section  5.2  can  be  simplified  if  we  assume  that  P  is  non¬ 
degenerate.  If  no  three  points  of  P  are  collineax,  then  any  triangular  shortest  path 
map  region  has  at  most  five  vertices  on  its  boundary.  In  this  case  the  sweeping 
path  algorithm  can  afford  to  visit  all  the  vertices  on  the  boundary  of  Vi3(st):  the 
algorithm  adds  or  deletes  an  edge  at  every  such  vertex.  Similarly,  the  depth-first 
search  algorithm  can  afford  to  check  every  edge’s  funnel  explicitly.  The  number  of 
edges  on  the  boundary  of  the  explored  subpolygon  is  proportional  to  the  number 
of  regions  of  SPM(s)  that  make  it  up. 

Even  if  degeneracies  are  allowed,  alternate  definitions  of  visibility  can  simplify 
our  algorithms.  If  we  define  two  points  to  be  mutually  visible  when  the  segment 
connecting  them  does  not  intersect  the  exterior  of  P,  the  size  of  the  visibility  graph 
may  increase.  In  a  polygon  with  many  collinear  points,  we  may  have  m  = 
under  this  definition  and  m  =  0(n)  imder  the  standard  definition.  Under  this 
broader  definition  of  visibility,  the  sweeping-path  algorithm  will  nm  in  0(m)  time 
without  using  special  cases  to  move  p  past  the  vertices  on  extension  edges. 

The  presence  of  the  0(n  log  log  n)  term  in  om  algorithm’s  nmning  time  suggests 
an  open  question:  Is  it  possible  to  triangulate  a  simple  polygon  in  0(m)  time?  If 
it  were,  we  would  have  an  optimal  visibility  graph  algorithm  for  simple  polygons. 
(There  are  polygons  with  m  =  0(n)  for  which  the  triangulation  algorithm  of  Tarjan 
and  Van  Wyk  takes  f2(nloglogn)  time.)  This  question  fits  into  a  framework  of 
earlier  work.  Several  authors  have  proposed  triangulation  algorithms  that  run  in 
time  0(n  log  fc)  for  some  parameter  k  related  to  the  complexity  of  the  polygon. 
For  example,  in  the  algorithm  of  Hertel  and  Mehlhom,  k  is  the  number  of  refiex 
angles  in  the  polygon  [HM83];  in  that  of  Chazelle  and  Incerpi,  k  is  the  “sinuosity” 
of  the  polygon  [CI84].  In  some  sense,  we  ask  less  of  our  triangulation  algorithm 
than  these  authors.  We  want  it  to  run  in  0(m),  but  this  can  be  much  larger  than 
O(nloglogn).  When  m  is  smaller  than  0(n  log  log  n),  the  polygon  is  narrow  and 
few  triangulations  are  possible.  This  may  make  it  easier  to  find  one  of  them. 


Chapter  6 

Optimal  Shortest  Path  Queries  in 
a  Simple  Polygon 


How  many  miles  to  Babylon? 

Threescore  and  ten. 
Can  I  get  there  by  candle-light? 

Yes,  and  back  again. 

—  Nursery  rhyme 


This  chapter,  like  Chapter  4,  focusses  on  the  problem  of  finding  shortest  paths 
for  a  point  moving  inside  a  simple  polygon  with  n  vertices.  Several  algorithms 
have  been  proposed  to  solve  this  problem.  All  the  methods  (including  the  one 
presented  here)  are  based  on  a  triangulation  of  the  polygon.  The  algorithm  of  Lee 
and  Preparata  finds  the  shortest  path  between  two  points  inside  a  simple  polygon 
in  linear  time,  once  a  triangulation  is  known  [LP84].  Reif  and  Storer’s  [RS85] 
approach  uses  precomputation  to  speed  up  queries.  Given  a  source  point  inside  the 
polygon,  their  method  produces  a  search  structure  so  that  the  distance  from  a  query 
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point  to  the  source  can  be  found  in  O(log  n)  time.  The  shortest  path  itself  can  be 
obtained  in  time  proportional  to  the  number  of  turns  along  it.  Reif  and  Storer’s 
method  iises  the  Delaunay  triangulation  of  the  polygon  and  hence  talces  0(n  log  n) 
preprocessing  time.  Chapter  4  shows  how  to  set  up  a  similar  query  structure  with 
less  preprocessing.  That  algorithm  talces  linear  time  once  a  triangulation  is  known. 

The  sub-linear  query  algorithms  mentioned  above  pre-select  a  fixed  source  point. 
If  neither  endpoint  of  the  path  is  predetermined,  each  of  these  algorithms  takes 
linear  time  to  find  the  shortest  path  (or  even  its  length).  The  present  chapter 
removes  the  single-source  restriction;  it  gives  a  data  structure  that  supports  sub- 
linear  shortest  path  queries  when  both  endpoints  are  part  of  the  query.  The  data 
structure  can  be  built  in  linear  time  once  the  polygon  has  been  triangtdated.  After 
the  data  structure  hais  been  built,  the  query  algorithm  can  find  the  length  of  the 
shortest  path  between  two  arbitrary  points  inside  the  polygon  in  logarithmic  time. 
The  path  itself  can  be  obtained  in  additional  time  proportional  to  the  number  of 
tiims  along  it.  These  bounds  are  clearly  best  possible. 

The  idea  imderlying  our  method  is  not  difficult.  The  preprocessing  phase  creates 
a  hierarchy  of  nested  subpolygons  over  an  underlying  triangulation,  such  that  any 
shortest  path  crosses  only  a  small  number  of  subpolygons.  We  store  information 
about  shortest  paths  inside  each  such  subpolygon.  At  query  time,  we  obtain  the 
shortest  path  between  the  query  points  by  assembling  path  information  from  the 
subpolygons  between  them. 

Section  6.1  describes  the  balanced  hierarchical  decomposition  of  P  that  we  will 
use.  (Appendix  A  tells  how  to  construct  it.)  Section  6.2  characterizes  shortest 
paths  in  terms  of  this  decomposition.  Section  6.3  describes  the  “funnels”  £ind 
“hourglasses”  associated  with  shortest  paths,  as  well  as  the  data  structures  for 
representing  them.  These  structures  allow  linear  preprocessing  and  O(log*  n)  query 
time.  Finally,  Section  6.4  shows  how  to  use  an  additional  0{n)  preprocessing  time  to 
enhance  the  structures  of  the  previous  section,  thus  yielding  the  desired  logarithmic 
query  time.  Section  6.5  discusses  various  extensions  and  applications. 
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6.1  Balanced  hierarchical  decomposition  of  P 

This  section  introduces  a  method,  due  to  Chazelle  and  Guibas  [CG85],  of  splitting 
the  polygon  P  into  subpolygons.  The  important  property  of  the  decomposition 
is  that  the  shortest  path  in  P  between  any  pair  of  points  passes  through  only  a 
logarithmic  number  of  subpolygons.  The  shortest  path  can  then  be  derived  from  the 
sequence  of  subpolygons  by  referring  to  pre-computed  information  about  shortest 
paths  inside  each  subpolygon. 

The  polygon  cutting  theorem  of  Chazelle  states  that  any  simple  polygon  of  at 
least  four  vertices  has  a  diagonal  that  divides  it  into  two  subpolygons  of  roughly 
equal  size  [Cha82].  Specifically,  if  the  polygon  P  has  n  sides,  each  of  the  two 
subpolygons  Pi  and  Pa  produced  by  splitting  has  at  least  n/3  + 1  sides  and  at  most 
2n/3  +  1:  the  larger  subpolygon  is  less  than  twice  as  big  as  the  smaller.  If  we 
apply  the  theorem  recursively  to  split  each  of  the  subpolygons,  we  get  a  balanced, 
hierarchical  decomposition  of  P  into  triangles.  These  triangles  form  a  triangulation 
of  P. 

The  decomposition  is  easier  to  work  with  if  we  interpret  it  as  a  binary  tree  S 
whose  nodes  correspond  to  the  splitting  diagonals  of  P.  At  the  root  of  the  tree  is  the 
diagonal  that  splits  P  into  Pi  and  P2.  The  children  of  the  root  are  the  roots  of  the 
(recursively  defined)  decomposition  trees  corresponding  to  Pi  and  Pj.  We  can  think 
of  the  decomposition  as  proceeding  in  stages:  at  time  0  the  root  diagonal  splits  P 
into  two  balanced  subpolygons;  the  children  of  the  root  split  these  subpolygons  at 
time  1,  and  so  on.  The  time  at  which  a  particular  diagonal  is  introduced  corresponds 
to  its  depth  in  the  tree  S.  We  call  the  subpolygons  produced  during  this  process 
cells.  These  cells  are  the  polygons  for  which  we  pre-compute  and  store  shortest-path 
information.  They  are  fundamental  to  om  approach  to  the  shortest  path  problem, 
and  it  is  worth  taking  time  to  understand  them. 

For  ease  of  reference,  we  associate  with  each  diagonal  d  the  cell  it  splits  in  two; 
we  call  the  cell  Pd.  The  diagonal  d  cuts  Pd  into  two  other  cells;  by  the  definition  of  5, 
the  children  of  d  in  5  are  the  diagonals  associated  with  those  cells.  The  depth  of  a 
cell  Pd  is  equal  to  the  depth  of  d  in  S.  Triangles  of  the  imderlying  triangulation  are 
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cells  not  associated  with  a  diagonal;  we  assign  such  a  triangle  a  depth  one  greater 
than  the  depth  of  its  deepest  bounding  diagonal.  Because  the  decomposition  is 
balanced;  the  depth  of  a  diagonal  d  in  5  is  related  to  the  size  of  Pj.  Let  the  depth 
of  d  be  i{d)  (the  depth  of  the  root  is  0).  Then  it  follows  from  the  splitting  boimds 
that  Pd  has  no  more  than  0((2/3)^(‘*^n)  vertices  on  its  boundziry.  This  fact  implies 
that  the  tree  S  has  height  logarithmic  in  n. 

The  edges  bounding  a  cell  are  of  two  kinds,  edges  of  P  and  diagonals  of  P. 
Polygon  edges  act  as  walls:  shortest  paths  in  Pd  cannot  cross  them.  Diagonals 
act  as  doors:  shortest  paths  that  pass  through  Pd  cross  diagonals  to  enter  and 
leave.  As  it  happens,  a  cell  has  relatively  few  bounding  diagonals.  The  diagonals 
bounding  Pd  must  all  be  of  different  depths  less  than  i(d),  as  follows  easily  by 
induction.  Therefore  Pd  has  at  most  £{d)  =  0(log  n)  diagonals  on  its  boimdary. 

Shortest  paths  that  pass  through  a  cell  enter  and  leave  by  a  particular  pair 
of  diagonals.  To  systematically  refer  to  these  pairs,  we  define  a  graph  5*,  called 
the  factor  graph,  based  on  the  decomposition  tree  S.  For  each  cell  Pd,  S  has  an 
edge  connecting  d  to  its  parent,  the  deepest  diagonal  bounding  Pj.  We  get  5* 
from  S  by  adding  edges  connecting  d  not  only  to  its  parent,  but  to  all  the  diagonals 
bounding  P,*.  By  construction,  S*  contains  5  as  a  subgraph.  Each  diagonal  d  has 
at  most  £(d)  edges  in  the  factor  graph  connecting  it  to  diagonals  of  lesser  depth, 
one  to  each  boimding  diagonal  of  Pd.  Furthermore,  since  a  diagonal  d  is  adjacent 
to  only  two  cells  at  each  stage  of  the  decomposition,  d  has  edges  to  at  most  two 
diagonals  of  each  depth  greater  than  £(d).  These  observations  give  a  logarithmic 
degree  bound  for  the  nodes  of  S*. 

Each  edge  of  5*  represents  a  pair  of  diagonals  for  which  we  store  information 
about  shortest  paths  crossing  the  diagonals.  To  have  any  hope  of  storing  the  in¬ 
formation  in  a  linear-size  structure,  we  must  show  that  the  factor  graph  has  only 
linearly  many  edges. 

Lemma  6.1.  The  size  of  the  factor  graph  5*  is  0(n). 

Proof;  For  a  node  d  in  5,  the  height  of  d,  denoted  by  h(d),  is  the  length 
of  the  longest  path  to  a  leaf  below  d.  If  e  denotes  this  deepest  leaf,  then 
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the  height  of  d  is  £(e)  —  £(d).  Because  S  is  balanced,  there  cannot  be 
many  diagonals  with  large  height.  In  particular,  if  h{d)  is  k,  then  it  is 
easy  to  prove  inductively  that  Pd  must  contain  at  least  |^(3/2)*'“^  +  ij 
triangles  of  the  underlying  triangulation.  The  cells  belonging  to  diago¬ 
nals  with  the  same  height  kin  S  are  all  disjoint;  it  follows  that  there  are 
only  0((2/3)*n)  diagonals  with  height  k  in  5.  The  graph  S*  has  n  —  3 
nodes  and  at  most  2h(d)  edges  to  descendants  &om  each  node  d  (one 
for  each  depth  on  each  side).  Overall  therefore,  5*  has  at  most 

2/i(d)  =  0(  =  0(n) 

d^S*  ^<l+log3/2»* 

edges,  which  was  to  be  proved.  | 

We  can  compute  the  decomposition  tree  S  and  its  extension  S*  in  the  time  it 
takes  to  triangulate  P  plus  linear  additional  time.  We  use  the  algorithm  of  Tarjan 
and  Van  Wyk  [TV86]  to  triangulate  the  polygon  in  0(n  log  log  n)  time,  then  find 
a  balanced  hierarchical  decomposition  using  the  linear-time  method  described  in 
Appendix  A.  From  the  decomposition  we  can  easily  get  both  S  and  S*.^ 


6.2  Shortest  paths  and  the  polygon  decomposi¬ 
tion 

This  section  analyzes  the  sequence  of  diagonals  crossed  by  a  shortest  path  in  P. 

It  defines  a  logarithmic-size  subsequence  of  those  diagonals  that  helps  determine 

the  shortest  path.  The  most  important  characteristic  of  the  subsequence  is  that 

every  pair  of  diagonals  adjacent  in  it  lies  on  the  boundary  of  some  cell  of  the 

decomposition,  so  the  pair  corresponds  to  an  edge  in  the  factor  graph.  We  can 

discover  this  subsequence  quickly,  once  we  know  the  triangles  containing  the  path 

^Note  that  the  decomposition  provided  by  the  polygon  cutting  theorem  is  rarely  unique;  different 
decompositions  result  from  different  choices  of  splitting  edges.  When  the  decomposition  is  unique, 
there  is  only  one  triangulation  of  the  polygon,  and  the  algorithm  of  Tarjan  and  Van  Wyk  finds  it. 
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endpoints.^  By  combining  the  pre-computed  information  stored  for  adjacent  pairs 
of  diagonaJs,  we  can  then  find  the  path  itself. 

For  the  remainder  of  this  section,  let  us  fix  our  attention  on  the  shortest  path 
connecting  two  particular  points  p  and  q  in  P.  The  triangles  containing  the  points 
p  and  q  in  the  underlying  triangulation  associated  with  S  determine  the  sequence  of 
diagonals  crossed  by  the  shortest  path  between  the  two  points.  These  diagonals  each 
split  P  into  two  parts,  one  containing  p  and  the  other  q.  We  call  these  diagonals 
separating  for  p  and  q;  the  shortest  path  from  p  to  g  crosses  only  these  diagonals, 
and  each  of  them  exactly  once.  In  fact,  the  shortest  path  between  the  points  must 
cross  these  separating  diagonals  in  a  particular  order.  When  the  shortest  path 
from  p  to  q  crosses  a  diagonal  d,  it  must  have  already  crossed  all  the  separating 
diagonals  on  the  side  of  d  nearer  p  and  none  of  those  on  the  side  nearer  q.  Note 
also  that  in  this  ordering  adjacent  separating  diagonals  must  have  different  depths, 
and  that  between  any  two  diagonals  of  the  same  depth  there  must  be  one  of  lesser 
depth  (the  “lacuna”  property).  These  facts  are  easy  to  establish  and  their  proofs 
are  omitted. 

We  now  define  a  subsequence  of  the  separating  diagonals  that  will  be  useful  in 
finding  the  shortest  path,  the  subsequence  of  the  so-called  principal  separating  diag¬ 
onals.  Which  are  the  principal  diagonals?  Consider  the  lowest  common  ancestor  d 
in  5  of  all  the  separating  diagonals.  Clearly  d  itself  is  a  separating  diagonal  and  its 
depth  is  minimal  among  all  separating  diagonals.  Indeed,  the  above  remarks  imply 
that  d  can  be  characterized  by  this  minimal  depth  property.  We  define  d  to  be  a 
principal  diagonal.  The  separating  diagonals  nearest  p  and  q  are  also  defined  to 
be  principal  diagonals;  we  call  them  dp  and  d,,  respectively.  (Note  that  d  can  be 
dp  or  dq,  or  both,  if  they  coincide.)  We  then  define  Dp  to  be  the  subsequence  of 
the  sepeurating  diagonals  between  dp  and  d  obtained  as  follows:  Scan  through  this 
sequence  firom  dp  to  d  and  keep  track  of  the  minimum  depth  seen  so  far.  During 
the  scan,  discard  all  diagonals  with  depth  greater  than  the  current  minimum.  The 
remedning  diagonals  are  of  strictly  increasing  depth  (by  the  lacuna  property)  and 
form  Dp.  The  sequence  Dq  is  defined  symmetrically,  proceeding  firom  d,  to  d.  The 


^The  triangles  can  be  found  using  any  standard  O(logn)  point-location  method,  such  as  [EGS86]. 


CHAPTER  6.  TWO-POINT  SHORTEST  PATH  QUERIES 


118 


principal  diagonals  are  then  just  those  in  Dp  together  with  those  in  Dg. 

The  strict  monotonicity  of  depths  in  Dp  has  two  important  consequences.  First, 
it  implies  that  there  are  only  logarithmically  many  principal  diagonals  in  Dp.  Sec¬ 
ond,  each  diagonal  in  Dp,  except  d,  is  contained  in  the  cell  split  by  its  successor 
in  Dp,  and  hence  is  a  descendant  in  S  of  that  successor.  Therefore  Dp  is  a  subse¬ 
quence  of  the  diagonals  on  the  path  in  S  from  dp  to  d.  Analogous  statements  hold 
for  Dg. 

This  observation  makes  it  easy  to  compute  Dp  in  logarithmic  time.  Instead  of 

A 

scanning  through  all  the  sepeirating  diagonals  between  dp  and  d  in  the  polygon  P,  we 
can  just  look  at  the  diagonals  on  the  path  from  dp  to  d  in  the  tree  5.  If  any  of  these 
diagonals  is  between  d  and  dp  in  P,  it  is  a  separating  diagonal  and  belongs  to  Dp. 
Otherwise,  it  is  not  separating  and  not  in  Dp.  The  test  of  whether  one  diagonal 
lies  between  two  other  diagonals  can  be  performed  in  constant  time.  Here  is  an 
outline  of  the  method:  Let  T  be  the  dual  tree  of  the  triangulation.  Preprocess  it 
by  assigning  to  each  node  t  (corresponding  to  a  triangle)  its  preorder  and  postorder 
numbers  Pre{t)  and  Post(t).  Then  a  node  s  is  the  ancestor  of  node  t  if  and  only  if 
both  Pre(s)  <  Pre(t)  and  Post(s)  >  Post(t).  Translate  the  query  about  diagonals 
of  P  (corresponding  to  edges  of  T)  into  one  about  nodes  of  T.  The  query  asks 
whether  node  s  lies  on  the  path  from  r  to  t.  The  answer  is  yes  if  and  only  if  one  of 
two  conditions  holds:  s  is  an  ancestor  of  exactly  one  of  r  and  t,  or  it  is  their  lowest 
common  ancestor. 

Because  all  the  separating  diagonals  between  a  pair  e  and  /  of  adjacent  principal 
diagonals  in  Dp  have  depth  greater  than  c  and  /,  the  principal  diagonals  c  and  / 
must  lie  on  the  boundary  of  a  single  cell  and  be  connected  by  an  edge  in  the  factor 
graph.  These  adjacent  principal  diagonals  represent  pairs  for  which  the  prepro¬ 
cessing  phase  of  our  algorithm  stores  a  representation  of  zJl  shortest  paths  between 
points  on  the  diagonals  in  the  pair.  In  conclusion,  we  know  that  in  O(log  n)  time 
we  can  extract  a  sequence  of  0(log  n)  cells  in  whose  “concatenation”  the  shortest 
path  from  p  to  g  must  lie. 
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6.3  Hourglasses  and  shortest  paths 

This  section  uses  an  enhanced  version  of  the  hourglasses  introduced  in  Section  4.3 
to  represent  shortest  paths  between  two  diagonals.  An  hourglass  is  associated  with 
each  edge  of  the  factor  graph;  by  combining  the  hoiurglasses  of  the  edges  that 
connect  principal  diagonals,  we  get  a  representation  of  all  shortest  paths  between 
dp  and  dg.  The  previous  sections  have  been  purposefully  vague  about  the  notion 
of  “combining”  hoiurglasses;  we  now  make  the  combining  process  more  concrete. 
However,  we  postpone  describing  the  data  structures  that  implement  hourglasses 
vmtil  Subsection  6.3.3. 

In  Chapter  4  we  used  funnels  to  represent  all  shortest  paths  from  a  source  vertex 
to  an  edge.  Because  the  source  vertex  in  our  problem  is  not  fixed,  we  need  richer 
structures  than  funnels.  Hourglasses  provide  the  additioned  flexibility  we  need.  Let 
AB  and  CD  be  two  diagonals  in  the  triangulation  of  P,  labelled  so  that  BACD  is 
a  subsequence  of  the  vertex  sequence  of  P.  As  in  Section  4.3,  we  define  the  union 
of  the  two  shortest  paths  7r(A,  C)  and  7r(P,  D)  to  be  the  hourglass  of  AB  and  CD. 
(Please  refer  to  Figxire  6.1.)  If  the  paths  share  any  vertices,  then  the  hourglass  is 
essentially  two  funnels  joined  by  a  polygonal  path  between  their  apexes.  We  call 
this  shortest  path  between  the  apexes  the  string  and  refer  to  the  hoiurglass  as  closed. 
If  the  paths  are  disjoint,  the  hotnglass  consists  of  two  inward  convex  chains.  In  this 
case,  AB  and  CD  are  mutually  visible;  there  is  a  segment  with  one  endpoint  on  AB 
and  the  other  on  CD  that  avoids  the  polygon.  We  refer  to  the  hourglass  as  open. 

Given  a  point  p  on  AB  and  a  point  g  on  CD,  we  can  use  the  hourglass  to  find 
the  shortest  path  from  p  to  q.  If  the  hoiurglass  is  closed,  the  problem  decomposes 
into  two  single-source  queries.  The  complete  shortest  path  is  just  the  concatenation 
of  the  path  from  p  to  the  apex  of  its  funnel,  the  string,  and  the  path  from  the  apex 
of  the  other  funnel  to  q.  As  noted  in  Section  4.1,  the  path  from  p  (or  q)  to  the 
apex  of  its  funnel  begins  with  the  tangent  from  the  point  to  the  funnel,  then  follows 
funnel  edges  to  the  apex.  When  the  hourglass  is  open,  the  situation  is  slightly  more 
complicated.  If  p  and  q  are  mutually  visible  (segment  pq  avoids  the  hourglass). 


then  pq  is  the  shortest  path.  Otherwise,  consider  the  lines  of  sight  between  AB 
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ajid  CD  (segments  that  avoid  the  hourglass  and  have  one  endpoint  on  each  of  AB 
and  CD).  Suppose  one  of  the  lines  of  sight  has  p  and  q  on  the  same  side,  say 
the  side  where  A  and  C  lie.  Then  the  shortest  path  includes  the  tangent  from  p 
to  ir(A,  C),  the  tangent  from  q  to  7r(i4,  C),  and  the  portion  of  7r(i4,  C)  connecting 
the  tangents.  See  Figure  6.2(a).  If  every  line  of  sight  separates  p  from  q,  the  shortest 
path  includes  an  inner  common  tangent  between  the  hourglass  chains.  In  this  case 
there  is  an  imblocked  tangent  from  p  to  one  of  the  hourglass  cheuns,  say  7r(  A,  C), 
but  not  to  the  other.  Similarly,  there  is  a  tangent  from  q  to  ir{B,  D).  The  shortest 
path  between  p  and  q  includes  these  tangents,  the  inner  common  tangent  between 
7r(i4,  C)  and  7r(B,  D)  that  gives  the  shorter  path,  and  the  portions  of  the  hourglass 
chains  needed  to  connect  the  tangents.  Please  refer  to  Figure  6.2(b)  for  an  example 
of  this  type  of  shortest  path. 

Hourgleisses  are  especially  well  suited  to  our  algorithm  because  they  axe  con- 
catenable.  That  is,  if  diagonal  ds  separates  di  from  da,  it  is  easy  to  compute  the 
hourglass  for  the  diagonal  pair  (di,  da)  from  the  hourglasses  for  (di,  da)  and  (da,  da). 
There  are  several  special  cases,  but  the  basic  idea  is  simple:  we  just  find  the  com¬ 
mon  tangents  of  the  convex  chains  that  make  up  the  two  hourglasses.  The  special 
cases  arise  because  the  concatenation  of  two  open  hourglasses  can  be  closed. 

case  A:  If  the  hourglasses  for  (di ,  da)  and  (da,  da)  are  both  closed,  so  is  that  for  (di,  da). 
It  retains  the  fuimels  for  di  and  da  imchanged,  but  omits  both  funnels  for  da. 
The  shortest  path  7r(a,  a')  between  the  apexes  of  the  two  funnels  for  da  consists 
of  the  common  tangent  between  the  two  funnels  and  the  frmnel  portions  that 
connect  it  to  the  apexes.  The  string  for  (di,d3)  incorporates  the  strings  for 
(di,da)  and  (da, da),  linking  them  with  ;r(a,a').  Please  see  Figure  6.3. 

case  B:  If  one  homrglass  is  closed,  say  that  for  (di,  da),  and  the  other  is  open,  the  result 
is  closed.  Suppose  a  is  the  apex  of  the  funnel  for  da  in  the  closed  hourglass. 
Constructing  the  shortest  paths  from  a  to  the  endpoints  of  da  requires  a 
constant  number  of  common  teingent  computations.  These  shortest  paths 
define  a  funnel  on  da  with  apex  a\  The  new  hourglass  uses  this  funnel,  the 
old  funnel  on  dj,  and  augments  the  old  string  with  the  shortest  path  7r(a,a'). 


Figure  6.3.  Concatenating  two  closed  hourglasses 
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See  Figure  6.4. 


Figure  6.4.  Concatenating  one  closed  and  one  open  hoxirglass 

case  C:  If  both  hourglasses  are  open,  their  concatenation  may  be  either  open  or  closed. 
The  four  common  tangents  that  touch  one  chain  from  each  hourglass  deter¬ 
mine  the  result.  If  both  outer  common  tangents  are  unblocked,  the  new 
hourglass  is  open,  as  in  Figure  6.5(a).  Otherwise  the  result  is  closed,  and  the 
points  of  contact  of  the  inner  common  tangents  determine  the  apexes  of  the 
fimnels.  Refer  to  Figure  6.5(b). 

We  have  not  yet  seen  how  to  use  hourglasses  to  find  shortest  paths  if  the  path 
endpoints  are  not  on  the  hourglass  diagonals.  However,  this  is  not  a  difficult  exten¬ 
sion.  Suppose  dp  and  d,  are  the  first  and  last  diagonals  in  the  sequence  of  separating 
diagonals  between  p  and  g,  and  that  we  are  given  the  hourglass  for  (dp,d,).  The 
shortest  path  is  determined  by  the  tangents  to  the  hourglass  that  pass  through  p 
and  q.  One  way  to  think  of  this  is  to  view  p  and  q  as  tiny  diagoneds;  then  the  trian¬ 
gle  defined  by  p  and  dp  is  the  hourglass  for  (p,  dp).  Concatenating  the  hourglasses 
for  (p,  dp),  {dp,  dg),  and  (d„  q)  gives  a  single  closed  hourglass  with  no  funnels,  only 
string.  The  string  is  the  shortest  path  between  p  and  q. 
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(b) 


Figure  6.5.  Concatenating  two  open  hourglasses:  (a)  open  result,  and 
(b)  closed  result 
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6.3.1  Hourglass  data  structures:  specifications  and  use 

We  have  seen  that  concatenation  of  hourglasses  gives  a  way  to  find  shortest  paths. 
However,  the  details  of  representing,  constructing,  and  concatenating  hourglasses 
have  gone  unmentioned  so  fax.  The  sequel  of  this  section  corrects  that  omission, 
introducing  a  data  structure  that  allows  efiicient  representation  and  meinipulation 
of  hourglasses.  This  subsection  specifies  the  operations  that  the  structure  must 
support,  as  well  as  the  edlowable  space  and  time  complexities  of  eaxrh.  It  demon¬ 
strates  that  any  structure  satisfying  the  specification  can  be  used  to  answer  shortest 
path  queries  efficiently.  The  following  two  subsections  present  a  data  structure  that 
meets  the  specification. 

Let  us  begin  by  giving  a  notation  for  our  data  structure.  We  represent  the  hour¬ 
glass  between  diagonals  di  and  dj  by  the  data  structure  H{di,d2).  In  all  the  speci¬ 
fications  that  follow,  whenever  diagonals  di,  d2,  and  da  appear,  we  assxime  that  da 
lies  between  di  and  da  in  P.  This  means  that  hourglasses  H{d\,d2)  and  if(d2,da) 
can  be  concatenated.^ 

Concatenation  is  the  fundamental  hourglass  operation.  The  previous  section 
shows  that  concatenation  reduces  to  finding  common  tangents  between  pairs  of 
convex  polygonal  chains.  A  point  is  a  degenerate  convex  chain,  but  our  algorithms 
must  be  able  to  handle  the  general  case  about  as  well  aa  the  degenerate  one.  This 
is  the  import  of  the  following  specification  requirement: 

(1)  If  our  data  structure  allows  tangent  lines  from  a  point  to  hourglasses  H{di ,  d2) 
and  H{d2,dz)  to  be  found  in  times  Tx  and  r2  respectively,  then  the  common 
tangents  of  the  two  hourglasses  must  be  computable  in  time  0(ri  r2). 

This  is  not  too  restrictive  a  condition:  the  standard  algorithms  for  finding 
a  line  through  a  point  and  teingent  to  a  convex  chain  are  related  in  the  seune 
way  to  the  standard  algorithms  for  finding  common  tangents  between  two  convex 
chains  [OvL81][PS85b,  Section  3.3. 6-7]. 

^The  data  structure  for  hourglass  H{d\,d2)  need  not  be  identical  to  that  for  H{d2,di),  though 
the  two  are  closely  related.  In  this  section  we  will  assume  that  one  hourglass  can  be  obtained  from 
the  other  in  constant  time.  This  happens  to  be  true  for  the  data  structure  introduced  in  the  next 
two  subsections;  if  it  were  not  true,  we  could  achieve  the  same  effect  (at  twice  the  cost)  just  by 
computing  both  versions  of  every  hourglass. 
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We  want  our  data  structure  to  reflect  the  close  tie  between  finding  tangents  and 
concatenation.  In  particular,  the  structvire  must  use  space  efficiently:  we  cannot 
afford  to'"  represent  each  hourglass  as  an  explicit  polygon.  If  an  hourglziss  is  formed 
by  the  concatenation  of  two  others,  we  must  record  only  incremental  information. 
Furthermore,  concatenation  must  leave  its  operands  imchanged.  Thus  we  impose 
the  following  requirement: 

(2)  If  the  common  tangents  between  and  H{d2,dz)  can  be  fotmd  in 

time  r,  then  the  hourglass  H{di,d3)  can  be  constructed  &om  the  two  con¬ 
stituent  hourglasses  in  0(r)  time  and  additional  space.  Furthermore,  the 
original  two  hourglasses  are  unaltered  by  the  concatenation  operation. 

The  following  two  properties  of  the  hourglass  specification  guarantee  that  it  can 
be  used  to  answer  queries  quickly. 

(3)  If  tangents  from  a  point  to  H(di,d2)  and  H(d2,d3)  can  be  found  in  times  Ti 
and  Tj,  tangents  to  H(di,d3)  can  be  found  in  time  max(ri,r2)  +  C  for  some 
constant  C. 

(4)  If  tangents  from  a  point  to  H(di,d2)  can  be  found  in  time  r,  the  length  of 
the  shortest  path  between  query  points  on  di  and  dj  can  be  foxmd  in  0(t) 
time.  The  path  itself  can  be  found  in  0(t  4-  m)  time,  where  m  is  the  number 
of  turns  in  the  path. 

The  shortest  path  from  pto  q  crosses  a  sequence  of  0(log  n)  principal  diagonals. 
Every  pair  of  neighbors  in  that  sequence  is  connected  by  an  edge  in  the  factor 
graph  S’*.  Thus  if  we  store  H{d\,d2)  for  every  pair  of  diagonals  dx  and  d2  linked  by 
an  edge  in  S*,  we  will  have  all  the  hourglasses  we  need  to  compute  shortest  paths 
by  concatenation.  The  next  few  paragraphs  present  an  algorithm  to  construct  the 
necessary  hourglasses  in  linear  time  and  space. 

In  Section  6.1  we  imagined  the  decomposition  of  P  as  proceeding  in  a  series 
of  time  steps,  adding  diagonals  of  depth  k  at  time  k.  Now  consider  running  time 
backwards,  first  deleting  diagonals  of  greatest  depth  in  S,  then  second  greatest. 
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and  so  on.  Cells  are  created  roughly  in  increasing  order  of  size;  when  a  new  cell 
appears,  we  will  construct  hourglasses  between  every  pair  of  diagonsds  on  the  cell’s 
boundary.  When  the  last  diagonal  is  finally  removed,  we  will  have  constructed 
hourglasses  corresponding  to  all  the  edges  in  5*. 

The  inductive  invariant  of  the  construction  is  the  following:  just  before  the 
diagonals  of  depth  k  are  deleted,  all  hourglasses  in  cells  with  depth  greater  than  k 
have  been  computed.  The  basis  is  easy  to  establish:  before  deleting  any  diagonals, 
we  construct  trivial  hourglasses  between  all  pairs  of  diagonals  that  lie  on  the  border 
of  the  same  triangle  in  the  tmderlying  triangulation.  As  the  algorithm  runs,  the 
deletion  of  a  diagonal  d  of  depth  k  merges  two  cells  into  one  (the  cell  P^).  Each  of  the 
two  cells  has  a  set  of  hourglasses  with  d  at  one  end  {d  has  been  paired  with  every 
other  bounding  diagonal  of  the  cell).  To  maintain  the  invariant,  we  concatenate 
every  hourglass  from  one  side  of  d  with  every  hourglass  from  the  other  side  of  d. 
This  process  computes  hourglasses  for  exactly  those  pairs  of  diagonals  linked  by 
edges  in  S*,  and  it  reqxiires  only  one  concatenation  for  each  hourglass  generated. 

The  hourglasses  built  during  the  construction  have  special  properties  that  we 
exploit  to  prove  the  linear  time  and  space  botmds.  The  procedure  combines  hour¬ 
glasses  in  a  balanced  fashion,  so  that  the  cost  of  computing  common  tangents,  which 
could  be  linear  in  the  sizes  of  the  chains  involved,  is  instead  logarithmic.  To  express 
this  bound,  let  us  introduce  a  logarithmic  quantity  to  measure  hourglass  size.  If 
two  diagonals  di  and  d^  are  linked  by  an  edge  in  5*,  they  lie  on  the  boxmdary  of 
some  cell.  We  define  \{d\,d2)  to  be  the  logarithm  of  the  size  of  that  cell. 

Lemma  6.2.  If  d\  and  dj  are  linked  by  an  edge  in  the  factor  graph,  it  is  possible 
to  find  the  tangents  through  a  point  to  the  hourglass  H(di,d2)  in  time  0(A(di,  d2)). 

Proof:  Let  C'  denote  the  constant  implied  by  the  “0”  notation  in  the 
statement  of  the  lemma.  Like  the  hourglass  construction,  the  proof 
proceeds  by  induction.  The  claim  is  certainly  true  for  the  hourglasses 
present  before  any  diagonals  are  deleted.  The  cells  they  bridge  are 
triangles,  and  so  tangent  computation  takes  only  constant  time,  say 
time  D. 
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Now  consider  deleting  a  diagonal  d,  creating  cell  Pd  from  left  and 
right  subcells.  We  concatenate  every  hourglass  that  links  d  to  a  diagonal 
on'its  left  with  every  hourglass  that  links  d  to  a  diagonal  on  its  right. 

The  left  and  right  cells  are  within  a  factor  of  two  in  size,  so  Pd  is  at 
least  3/  :  the  size  of  the  larger.  By  the  third  data  structure  requirement 
above,  tangents  to  a  newly  created  hourglass  H{d\^d%)  can  be  found 
in  time  at  most  C"A(di,d2)  —  C'log(3/2)  +  C.  Choosing  C  to  be  the 
maximum  of  C/log(3/2)  and  D  completes  the  proof.  | 

We  now  have  the  tools  we  need  to  prove  the  linearity  of  the  construction.  The 
preceding  proposition,  along  with  the  first  and  second  data  structure  requirements, 
implies  that  an  hourglass  obtained  by  concatenating  two  hourglasses  across  diago¬ 
nal  d'  can  be  built  from  its  constituent  hourglasses  in  time  and  space  proportional 
to  the  logarithm  of  the  size  of  Pd>.  This  quantity  is  proportional  to  ^(d'),  the  height 
of  d'  in  5,  which  is  in  turn  less  than  the  height  of  the  higher  of  the  two  diagonals  the 
new  hourglass  links.  Let  this  higher  diagonal  be  d.  Since  there  are  0(h(d))  edges 
in  S*  joining  d  to  lower  diagonals,  the  hourglasses  corresponding  to  these  edges  can 
be  built  in  0((h(d)y)  time  and  space.  Overall,  the  construction  takes  time  and 
space  proportional  to 

5;(Md))*  =  0(  Z  m3)‘n)=0(n). 

d€S*  ^  *<l+log3/2  ^ 

The  hourglasses  built  by  the  construction  are  sufiicient  to  answer  shortest  path 
queries  in  O(log^  n)  time:  There  are  0(log  n)  hourglasses  between  p  and  g  that  link 
principal  diagoneds.  These  can  be  concatenated  into  a  single  long  hourglass  using 
O(log  n)  concatenations.  The  above  proposition  and  the  first  three  data  structure 
requirements  imply  that  each  concatenation  takes  0(log  n)  time.  Finally,  by  viewing 
p  and  g  as  diagonals,  we  can  concatenate  H(p,dp),  H(dp,dg),  and  H(dg,g)  to  get 
H(p,  g)  in  O(log  n)  additional  time.  This  hourglass  is  a  string  equal  to  the  shortest 
path  from  p  to  g.  Since  H(p,g)  is  just  a  string,  the  length  of  the  shortest  path  is 
available  as  soon  as  the  concatenation  is  finished.  By  the  fourth  requirement,  the 
path  itself  can  be  extracted  in  additional  time  proportional  to  the  number  of  turns 
along  it. 
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In  Section  6.4  we  will  see  how  to  improve  this  bound  to  O(logn). 

6.3.2  The  chain  data  structure 

Convex  polygonal  chains  axe  the  basic  constituents  of  hourglasses.  Because  of  the 
importance  of  such  chains,  we  devote  this  subsection  to  discussion  of  the  chain 
data  structure  and  its  properties.  This  data  structure  represents  ordered  convex 
chains  by  binary  trees,  in  a  manner  similar  to  the  dynamic  convex  hull  structure  of 
Overmaxs  and  van  Leeuwen  [OvLSl].  Subsection  6.3.3  then  uses  this  data  structure 
to  implement  hourglasses.  In  order  to  ensure  that  the  hourglasses  built  from  chains 
satisfy  the  four  conditions  previously  stated,  we  construct  the  chains  to  satisfy 
analogous  conditions. 

The  convex  chains  that  appear  in  hourglasses  have  special  properties  not  fotmd 
in  general  convex  chains.  Each  chain  is  a  shortest  path  inside  P:  if  the  chain  has 
endpoints  A  and  B,  the  chain  is  tt(A,  B).  The  polygon  touches  only  one  side  of 
the  chain,  and  hence  the  chain  is  convex.  The  chain  bulges  toward  the  polygon 
interior.  Because  the  chain  belongs  to  an  open  hourglass  or  fimnel,  it  cannot  spiral: 
the  chain  is  equal  to  its  own  convex  hull  minus  the  segment  AB. 

By  convention  we  assume  that  the  chain  7r(A,  B)  uses  vertices  from  the  boundary 
of  P  counterclockwise  from  A  and  clockwise  from  B.  We  think  of  the  chain  as  convex 
upward,  with  A  at  the  left  end. 

In  our  representation,  a  chain  can  be  one  of  two  types:  a  trivial  chedn,  which  is 
a  single  polygon  edge,  or  a  derived  chain,  which  is  the  convex  hull  of  two  sub  chains. 
In  other  words,  if  AC  is  not  a  polygon  edge,  the  chain  7r(i4,  C)  is  represented  as  the 
convex  hull  of  ^{A,  B)  and  7r(B,  C)  for  some  B  along  the  polygon  boundary  between 
A  and  C.  The  chain  contains  the  outer  common  tangent  of  7r(A,  B)  and  ^{B,  C), 
along  with  the  parts  of  the  subchains  that  connect  the  tangent  to  A  and  C.  Either 
subchain  may  contribute  nothing  to  the  shortest  path,  or  the  tangent  may  be  null. 
Figure  6.6  gives  examples  of  these  cases. 

A  tangent  edge  may  consist  of  a  single  point;  however,  we  ensure  that  there  axe 
not  many  such  null  edges  in  any  chain.  If  the  combination  of  two  chains  would  put 
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Figure  6.6.  A  chain  is  the  convex  hull  of  its  subchains.  Either  subchain 
may  contribute  no  edges  to  the  result,  or  the  tangent  edge  may  be  null. 

two  null  edges  next  to  each  other,  we  consider  the  older  of  the  two  to  be  deleted  by 
the  addition  of  the  new  one.  This  means  that  every  chain  has  fewer  null  edges  than 
non-null  ones.  Similarly,  we  guarantee  that  no  chain  has  adjacent  collinear  edges  by 
extending  the  tangent  edge  to  delete  any  subchain  edge  with  which  it  is  collinear. 

The  two  most  important  operations  supported  by  the  chain  data  structure  are 
tangent-finding  and  construction  of  a  chain  from  its  subchains.  These  are  the 
same  operations  needed  in  the  dynamic  convex  hull  algorithm  of  Overmars  and 
van  Leeuwen  [OvLSl].  Like  the  dynamic  convex  hxill  structure,  the  chain  data 
structure  is  basiceJly  a  binary  tree.  A  trivial  chain  is  represented  by  a  leaf  node 
that  contains  the  edge.  A  derived  chain  7r( A,  C)  is  represented  by  a  node  that 
contains  the  common  tangent  of  the  subchains  ?r(A,  B)  and  7r(S,  C),  along  with 
pointers  to  the  nodes  that  represent  those  subchains. 

The  data  structure  must  support  two  types  of  tangent  queries.  In  the  first,  we 
are  given  a  chain  7r(A,  O)  and  a  query  point  X  outside  the  convex  hull  of  the  chain. 
We  want  to  find  the  two  tangents  from  X  to  the  chain.  We  can  find  each  tangent 
by  looking  at  the  edge  stored  at  the  node  v  that  represents  7r(A,  C*)-  In  constant 
time  we  determine  which  subchain  the  tangent  touches,  then  find  the  tangent  by 
recursively  searching  in  the  subchain’s  tree.  This  search  takes  time  proportional  to 
the  height  of  the  tree  rooted  at  v.  The  second  type  of  tangent  query  asks  for  the 
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inner  and  outer  common  tzmgents  of  two  chains.  Neither  chain  is  contained  in  the 
convex  hull  of  the  other.  Using  the  techniques  of  Overmars  and  van  Leeuwen,  the 
tangents  can  be  found  in  time  proportional  to  the  sum  of  the  heights  of  the  trees 
that  represent  'k{A,  B)  and  7r(C,  D). 

In  the  hourglass  application  of  the  chain  data  structure,  we  do  not  need  to  com¬ 
pute  all  the  tangents  for  each  operation.  We  modify  the  tangent-finding  algorithms 
to  produce  only  the  desired  tangent.  A  tangent  query  just  needs  to  specify,  for  each 
chain,  whether  the  tangent  segment  heads  clockwise  or  counterclockwise  from  the 
tangent  point. 

Our  data  structure  is  more  complicated  than  that  of  Overmars  and  van  Leeuwen 
because  it  must  support  more  operations;  however,  the  additional  structures  de¬ 
scribed  below  do  not  slow  down  the  tangent-finding  operations.  Thus  we  have 
established  the  following  analogues  to  conditions  (1)  and  (3): 

(Cl)  If  the  lines  passing  through  a  given  query  point  and  tangent  to  chains  7r(  A,  B) 
and  ir(C,  D)  can  be  fovmd  in  times  Ti  and  r2,  the  common  tangents  of  the  two 
chains  can  be  foimd  in  time  0(ri  -f  r2). 

(C3)  Suppose  chain  ff(A,  C)  is  built  from  subchains  ?r(A,  B)  and  ir(B,C).  If  tan¬ 
gents  from  a  point  to  chains  Tf{A,  B)  and  7r(B,  C)  can  be  foimd  in  times 
Ti  and  r2,  tangents  to  their  combined  chain  ir(A,  C)  can  be  foimd  in  time 
max(ri,  r2)  -1-  (constant). 

Condition  (4)  requires  us  to  compute  path  lengths  in  hourglasses.  To  help 
find  path  lengths,  we  store  chain  lengths  in  the  chain  data  structure.  Consider  a 
chain  ir(A,E)  built  from  two  subchains  7r(A,C')  and  Tr(C,E).  The  subchains  are 
joined  by  the  tangent  edge  BD,  as  shown  in  Figure  6.7.  At  the  node  represent¬ 
ing  7r(A,  E),  we  store  the  lengths  of  its  components  7r(A,  B),  BD,  and  7r(D,  E). 
The  sum  of  these  lengths  is  the  length  of  7r(A,  E). 

For  ease  of  description,  let  us  give  names  to  some  of  the  fields  in  the  chain 
data  structure.  If  node  v  represents  the  chain  of  Figure  6.7,  v.tan  is  the  tangent 
edge  WD.  The  lengths  of  7r(A,B)  and  Tr{D,E)  are  v.lleri  and  v.rlen.  The  tangent 
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Figure  6.7.  A  canonical  example  of  a  chain  formed  by  combining  two 
subchains. 

edge  has  length  v.tlen.  For  convenience  we  define  v.len  to  be  the  length  of  the  whole 
chain  7r(A,  E),  the  sum  of  the  other  length  fields.  If  u’s  chzdn  is  just  a  single  edge, 
then  v.llen  =  v.rlen  =  0  and  v.len  =  v.tlen. 

The  following  lemma  helps  us  fill  in  the  length  fields  of  v. 

Lemma  6.3.  Let  v  be  the  root  of  the  tree  representing  the  chain  7r(A,  E)  of  Fig¬ 
ure  6.7.  Let  u  be  a  node  in  the  tree  whose  edge  u.ian  appears  in  w{A,  E),  and  let  X 
be  the  right  endpoint  of  u.ian.  The  length  of  the  chain  Tr(A,X)  can  be  determined 
by  looking  at  the  lengths  stored  at  the  nodes  along  the  path  from  v  to  u  in  the  tree. 

Proof:  The  proof  is  by  induction  on  the  length  of  the  path  from  v  to  u. 

If  the  path  is  of  length  zero,  then  u  =  v,  and  u.ian  =  BD.  The  length 
of  7r(A,X)  is  just  u.lhn  +  u.ilen,  the  length  of  if{A,B)  plus  the  length 

ofro. 

If  u  ^  u,  the  proof  has  two  cases,  depending  on  whether  u  is  in  the 
left  or  right  subtree  of  v.  If  u  is  left  of  u,  then  only  edges  from  the 
subchain  7r(A,B)  contribute  to  t:{A,X).  The  right  subchain  i^iC^E) 
has  no  effect  on  the  length  of  Tr{A,X).  By  induction,  we  can  find  the 
length  of  v{A,X)  by  looking  at  the  lengths  stored  along  the  path  from 
the  left  child  of  v  down  to  u. 

If  u  is  right  of  v,  then  7r(A.,  X)  consists  of  7r(A,  S),  BD,  and  some 
prefix  of  7r(D,  E).  Thus  the  length  of  7r(A,  X)  is  v.llen  +  v.ilen  plus  the 
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length  of  Tr(D,X).  This  last  term  is  the  same  as  the  length  of  'k{C,X) 
minus  the  length  of  7r(C,  D).  If  <  is  the  right  child  of  v,  the  node  as¬ 
sociated  with  t:{C,E),  then  the  length  of  ■k{A,X)  is  v.llen  v.tlen  -f 
(v.rlen  —  t.len)  plus  the  length  of  ir(C,X).  (This  is  just  v.len  —  t.len  plus 
the  length  of  7r(C',  X).)  By  induction,  we  can  find  the  length  of  ir(C, X) 
by  looking  at  the  nodes  on  the  path  from  t  to  u.  | 

When  we  build  chain  yr(A,  E)  out  of  subchains  ic(A,  C)  and  ir(C,  E),  we  find  the 
outer  common  tangent  of  the  two  subchains,  BD  in  Figure  6.7.  Let  v  be  the  node 
that  represents  ff(.A,  E),  and  let  c  be  the  edge  of  ir(i4,  C)  that  is  just  to  the  left  of 
the  tangent  BD.  Edge  c  is  u.tan  for  some  node  it  in  the  tree  representing  the  left 
subchain.  We  find  it  during  the  tangent-finding  operation.  The  preceding  lemma 
shows  that  we  can  determine  v.llen  by  looking  at  the  nodes  along  the  path  from  it 
up  to  the  root  of  the  left  subtree.  We  can  find  v.rlen  in  a  similar  manner.  Thus 
all  the  length  fields  can  be  computed  in  time  proportional  to  the  height  of  v.  This 
completes  our  discussion  of  the  length  fields. 

The  hardest  data  structure  requirement  to  satisfy  is  the  edge  retrieval  one:  given 
a  cheiin  data  structure,  we  must  be  able  to  retrieve  the  edges  of  the  chain  in  constant 
time  apiece.  Retrieval  in  constant  time  per  edge  is  especially  difficult  because  the 
data  structure  for  a  chain  must  not  modify  the  corresponding  structures  for  its  left 
and  right  subchains. 

Our  solution  uses  a  compact  representation  of  the  differences  between  the  chain 
and  its  left  and  right  subchains.  The  representation  is  based  on  storing  the  two 
paths  from  the  root  to  the  nodes  u  and  y  whose  edges  u.tan  and  y.tan  are  adjacent 
on  either  side  to  the  tangent  between  the  subchains. 

To  be  more  concrete,  consider  the  node  v  that  represents  the  chain  in  Figure  6.7. 
Node  V  has  two  lists  associated  with  it,  a  left  and  a  right  edge  retrieval  list,  or 
retrieval  list  for  short.  Each  record  r  in  the  left  retrieval  list  has  two  pointers,  one 
to  a  node  in  u’s  left  subtree,  and  one  to  a  record  in  another  retrieval  list,  called  the 
•partner  of  r.  The  right  retrieval  list  is  analogous  to  the  left  retrieval  list. 

We  define  retrieval  lists  in  two  stages:  we  specify  the  node  pointers  here  and 
the  partner  fields  below.  Let  w  be  the  left  child  of  v,  and  let  u  be  the  node  in  the 
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subtree  rooted  at  w  whose  edge  u.ian  appears  in  7r(^,  E)  and  is  incident  to  B.  The 
records  in  u’s  left  retrieval  list  point  to  the  nodes  on  the  path  from  w  to  u,  inclusive, 
whose  associated  edges  belong  to  7r(A,  E).  The  retrieval  list  keeps  the  nodes  in  the 
same  order  as  the  path  from  w  to  u.  We  have  specified  the  node  pointers  of  the  left 
retrieval  list;  those  of  the  right  retrieval  list  axe  defined  analogously. 

The  following  lemma  partially  characterizes  edge  retrieval  lists,  and  thereby 
enables  us  to  specify  the  partner  fields  of  list  records. 

Lemma  6.4.  Let  s  and  t  be  two  nodes  pointed  to  by  consecutive  records  in  the  left 
retrieval  list  of  v.  Then  these  two  nodes  are  in  an  ancestor/descendant  relationship. 
If  s  is  the  ancestor,  then  t  is  in  the  right  subtree  of  s,  and  some  record  in  the  right 
retrieval  list  of  s  points  to  i.  Analogous  claims  hold  for  records  in  the  right  retrieval 
list  of  V. 

Proof:  We  first  prove  that  t  is  in  the  right  subtree  of  s.  Because  u.tan 
is  adjacent  to  v.tan,  it  must  lie  to  the  right  of  s.tan,  and  hence  u  is  in 
the  right  subtree  of  s.  Since  the  path  from  ti  to  v  goes  through  t,  t  must 
be  in  the  right  subtree  of  s. 

Because  fs  associated  edge  t.ian  appears  in  v’s  chain  ‘ir(A,  E),  it  also 
appears  in  the  chain  r:  represented  by  s.  All  the  edges  between  s.tan 
and  t.tan  in  ir{A,  E)  also  belong  to  x. 

Let  X  be  the  descendant  of  s  such  that  x.tan  is  immediately  to  the 
right  of  s.tan  in  x.  Either  x  =  t  or  a:  is  to  the  left  of  the  path  from  s 
to  t.  Let  y  be  the  lowest  common  ancestor  of  x  and  t.  Since  x  and  t  are 
both  to  the  right  of  s,  y  is  not  equal  to  s. 

In  any  chain  in  which  y.tan  appears,  it  must  lie  between  x.tan  and 
t.tan,  inclusive;  otherwise  x  and  t  would  lie  in  the  same  subtree  of  y. 
Edges  x.tan  and  t.tan  both  belong  to  x(A,  E),  and  hence  so  does  y.tan. 

Node  y  lies  on  the  path  from  u  to  u’s  left  child  w,  so  it  must  belong  to 
u’s  left  retrieval  list.  Because  s  ^  y,  it  must  be  that  y  =  t.  Hence  t  is 
an  ancestor  of  x.  The  edge  t.tan  belongs  to  x,  and  so  some  record  in 
the  right  retrieval  list  of  s  must  point  to  t.  I 
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We  can  now  specify  the  partner  fields  of  retrieval  list  records.  The  partner  field 
of  the  first  record  in  a  retrieval  list  is  mill.  Let  r  and  r'  be  two  consecutive  records 
in  v's  left  retrieval  list  that  point  to  nodes  s  and  t,  with  s  an  ancestor  of  t.  Then 
the  partner  field  of  r'  points  to  the  record  in  the  right  retrie\’al  list  of  s  that  points 
to  t.  The  partner  fields  of  records  in  the  right  retrieval  list  are  defined  analogously. 

The  following  two  lemmais  show  that  edge  retrieval  lists  allow  us  to  report  the 
edges  of  a  chain  in  constant  time  apiece. 

Lemma  6.5.  Let  r  be  a  record  in  the  left  retrieval  list  of  v,  and  let  s  be  the  node  it 
points  to.  Using  the  part  of  the  retrieval  list  r  beads,  we  can  retrieve  all  the  edges 
of  u’s  chain  strictly  between  s.tan  and  v.tan  in  left-to-right  order  and  in  constant 
time  per  edge.  A  similar  claim  holds  for  right  retrieval  lists. 

Proof:  The  proof  is  by  double  induction  on  the  height  of  v  and  the 
length  of  the  retrieval  list  tail  r  heads. 

If  V  has  height  zero,  that  is,  it  has  no  child  trees,  its  retrieval  lists 
are  null  and  the  claim  is  vacuously  true.  The  claim  also  holds  any  time 
the  retrieval  list  tail  is  empty:  there  axe  no  edges  to  retrieve. 

If  r  is  the  last  record  in  u’s  left  retrieval  list,  the  claim  is  true.  There 
axe  no  edges  in  v's  chain  between  s.tan  and  t.tan. 

Now  assume  the  claim  holds  for  all  nodes  with  height  less  than  that 
of  V.  Further  assume  that  it  holds  for  nodes  with  v’s  height  and  retrieval 
list  tails  shorter  than  the  one  r  heads.  Let  r'  be  the  successor  of  r  in 
v’s  left  retrieval  list,  and  let  t  be  the  node  pointed  to  by  r'.  We  need  to 
retrieve  all  the  edges  of  v’s  chain  between  s.tan  and  t.tan. 

Since  both  s.tan  and  t.tan  appear  in  v’s  chain,  the  edges  between 
them  in  v’s  chain  are  also  edges  of  the  chain  represented  by  s.  As 
noted  in  Lemma  6.4,  some  record  in  the  right  retrieval  list  of  s  points 
to  t.  The  partner  field  of  r’  points  to  that  record.  By  the  induction 
hypothesis,  we  can  retrieve  all  the  edges  between  s.tan  and  t.tan  using 
the  tail  of  the  right  retrieval  list  of  s  that  r'  points  to,  since  the  height 
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of  s  is  less  than  that  of  v.  In  constant  time  we  report  t.ian.  Again  by 
the  induction  hypothesis,  we  can  report  all  the  edges  between  i.tan  and 
v.tdn  in  constant  time  apiece,  since  the  list  r'  heads  is  shorter  than  the 
one  r  heads.  | 

Lemma  6.6.  Usiag  v’s  left  edge  retrieval  list,  we  can  report  all  the  edges  of  v’s 
chain  that  are  to  the  left  of  v.tan.  We  produce  the  edges  in  left-to-right  order  and 
at  constant  cost  per  edge.  A  similar  claim  holds  for  v ’s  right  retrieval  list. 

Proofs  We  consider  only  the  edges  to  the  left  of  v;  the  other  case  is 
analogous.  The  proof  is  by  induction  on  the  height  of  v.  If  u  has  no 
children,  its  retrieval  lists  are  null  and  the  claim  is  trivially  true.  The 
claim  is  similarly  true  whenever  the  left  retrieval  list  is  null. 

If  the  left  retrieval  list  is  not  null,  let  r  be  the  head  of  the  list,  and 
let  t  be  the  node  to  which  r  points.  We  prove  that  t  is  in  the  left  subtree 
of  each  of  its  ancestors  on  the  path  to  u.  If  t  were  to  the  right  of  any 
of  its  ancestors,  the  edge  belonging  to  the  highest  such  ancestor  would 
belong  to  v’s  chain,  since  all  the  edges  belonging  to  higher  nodes  would 
be  to  the  right  of  t.ian.  Since  none  of  t’s  ancestors  has  an  edge  that 
belongs  to  u’s  chain,  t  is  to  the  left  of  all  its  ancestors. 

The  left  part  of  t’s  chain  is  not  hidden  by  any  of  t’s  ancestors,  and 
so  it  belongs  to  v’s  chain.  By  induction,  we  can  report  the  edges  left  of 
t.tan  using  t’s  left  retrieval  list.  We  report  t.tan  in  constant  time.  By 
Lemma  6.5,  we  can  report  the  edges  between  t.tan  and  v.tan  in  order 
and  in  constant  time  apiece  using  u’s  left  retrieval  list.  I 

We  have  shown  that  edge  retrieval  lists  can  be  used  to  report  the  edges  of  a  chain 
in  constant  time  apiece.  The  lists  for  a  node  v  are  small:  their  size  is  bounded  by 
a  constant  times  the  height  of  v.  In  the  remainder  of  this  subsection,  we  show  how 

to  construct  edge  retrieval  lists  efficiently. 

Let  u  be  the  node  in  u’s  left  subtree  such  that  u.tan  is  adjacent  to  v.tan  in 
v’s  chain.  The  left  retrieval  list  of  v  points  to  nodes  on  the  path  from  v  to  «.  To 
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construct  the  retrieval  list,  we  must  be  able  to  tell  which  nodes  on  the  path  have 
edges  in  u’s  chain.  We  give  a  method  for  recognizing  these  nodes  below. 

Once  we  know  which  nodes  on  the  path  to  u  have  edges  in  u’s  chain,  we  can 
build  a  retrieval  list  with  pointers  to  these  nodes.  We  then  fill  in  the  partner  fields 
of  the  records  in  the  list.  The  partner  field  of  the  first  record  in  the  retrieval  list 
is  null.  For  each  pair  of  nodes  s  and  t  pointed  to  by  consecutive  records  r  and  r' 
in  u’s  left  retrieval  list,  we  fill  in  the  partner  field  of  r'  by  walking  down  the  right 
retrieval  list  of  3  to  the  record  that  points  to  t.  That  record  is  the  partner  of  /. 
Setting  all  the  partner  fields  takes  time  proportion2J  to  the  height  of  v.  When  we 
walk  down  the  right  retrieval  list  of  s  looking  for  a  record  that  points  to  t,  we  see 
records  that  point  to  nodes  with  distinct  heights  less  than  that  of  a  and  no  smaller 
than  that  of  t.  Overall,  the  number  of  records  we  examine  while  setting  partner 
fields  is  less  than  the  length  of  the  path  from  v  to  u. 

We  now  tell  how  to  recognize  the  nodes  on  the  path  from  v  to  u  that  have  edges 
in  v’s  chain.  Our  approach  is  based  on  comparing  pairs  of  diagonals  or  polygon 
edges  using  the  hiding  test  described  below.  Let  t  be  a  node  in  the  tree  rooted  at  v. 
If  t.tan  does  not  appear  in  u’s  chain,  there  is  some  edge  ab  in  u’s  chain  such  that 
t.tan  is  contained  in  the  pocket  defined  by  ab  and  the  polygon  boimdary  between 
a  and  b.  We  say  that  ab  bides  t.tan.  See  Figure  6.8. 


Figure  6.8.  Segment  ab  hides  cd 

We  can  test  whether  one  edge  hides  another  by  comparing  the  indices  of  their 
endpoints,  a-«;siiming  all  the  endpoints  are  polygon  vertices.  Suppose  that  the  edge 
^nnecting  vertices  i  and  /  belongs  to  u’s  chain,  and  i  is  left  of  I  on  the  chain.  If 
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i  <  /,  the  polygon  vertices  are  numbered  consecutively  along  the  boundary  of  the 
pocket  cut  off  by  17.  In  this  case,  an  edge  jk  is  hidden  by  il  if  and  only  ii  jk  ^  il 
and  both'integers  j  and  k  lie  in  the  interval  [t,  /].  If  i  >  /,  an  edge  jk  is  hidden  by  il 
if  and  only  if  jk  ^  il  and  both  j  and  k  are  outside  the  interval  [/  +  1,  *  —  1]. 

At  query  time,  not  all  chain  vertices  are  polygon  vertices:  the  query  points  p 
and  q  appear  in  some  chains.  Points  p  and  q  are  not  necessarily  polygon  vertices, 
so  we  must  assign  them  vertex  numbers  before  using  the  hiding  test.  Suppose  that 
source  point  p  lies  in  triangle  Aijk  and  that  dp  is  diagonal  ik.  (See  Figure  6.9.) 
We  construct  the  hourglass  from  p  to  9  by  pretending  p  is  a  polygon  vertex  (it  is 
as  if  the  darkly  shaded  part  of  the  polygon  were  absent).  We  assign  to  p  the  vertex 
number  J.  This  gives  a  consistent  counterclockwise  numbering  to  the  vertices  of  the 
unshaded  subpolygon  and  allows  us  to  test  hiding  as  described  above. 


k 


Figure  6.9.  We  assign  vertex  numbers  to  p  and  q,  and  hence  are  able  to 
use  the  hiding  test  at  query  time. 

The  following  lemma  shows  how  to  use  the  hiding  test  to  construct  retrieval 
lists. 

Lemma  6.7.  Let  u  be  the  node  in  the  left  subtree  of  v  such  that  u.tan  is  adjacent 
to  v.tan  in  v’s  chain,  and  let  t  be  a  descendant  of  v  on  the  path  from  v  to  u.  Let  s 
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be  the  lowest  proper  ancestor  of  t  such  that  s.tan  appears  in  v ’s  chain.  Either  t.ian 
is  hidden  by  s.tan  or  by  v.tan,  or  it  appears  in  v’s  chain. 

Proof:  If  t.tan  does  not  appear  in  u’s  chain,  it  is  hidden  by  an  edge 
belonging  to  one  of  its  ancestors.  If  s  =  w,  then  v.tan  is  the  only 
edge  that  can  hide  t.tan.  If  s  ^  v,  Lemma  6.4  implies  that  t  is  in  the 
right  subtree  of  s.  By  the  same  lemma,  s  is  in  the  right  subtree  of  all 
its  aincestors  that  appear  in  u’s  left  retrieval  list.  The  edges  of  those 
ancestors  cannot  hide  t.tan:  they  are  left  of  s.tan  and  t.tan  is  not.  The 
only  remaining  ancestors  of  t  whose  edges  may  hide  t.tan  are  s  and  v. 

I 

To  determine  which  nodes  on  the  path  from  u  to  u  have  edges  in  v’s  chain,  we 
walk  down  the  path  testing  each  one.  Suppose  t  is  the  current  node  and  s  is  its 
lowest  proper  ancestor  such  that  s.tan  appears  in  v’s  chain.  If  t.tan  is  not  hidden 
by  s.tan  or  v.tan,  then  it  appears  in  v’s  chain. 

We  have  now  described  all  the  elements  of  the  chain  data  structure.  A  chain 
is  represented  by  the  root  node  of  a  binary  tree;  the  node  is  augmented  with  edge 
retrieval  lists  to  help  report  the  edges  of  the  chain.  Including  its  retrieval  lists, 
each  node  uses  storage  proportional  to  the  height  of  its  tree.  We  have  shown  how 
to  construct  a  chain’s  tree  from  its  subchains’  trees  in  time  proportional  to  the 
height  of  the  new  tree.  The  new  chain’s  data  structure  does  not  modify  any  of 
the  old  chains’  data  structures.  Taken  together,  these  facts  establish  the  following 
analogues  to  requirements  (2)  and  (4): 

(C2)  If  the  outer  common  tangent  of  chains  Tr(A,B)  and  ir(B,C)  can  be  fotmd 
in  time  r,  the  data  structure  for  their  combined  chsiin  7r(A,  C)  can  be  built 
in  0(r)  time  and  additional  space.  Furthermore,  the  data  structures  for  the 
original  two  chains  are  imaltered  by  the  construction. 

(C4)  The  length  of  a  chain  can  be  found  in  constant  time.  The  edge  sequence  of 
the  <’ba.in  can  be  extracted  in  constant  time  per  edge. 
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6.3.3  Hourglass  data  structures:  implementation 

This  subsection  uses  the  chains  of  Section  6.3.2  to  implement  the  hourglass  data 
structure.  The  hourglass  data  structure  satisfies  the  requirements  emmciated  at 
the  beginning  of  this  section,  and  hence  it  can  be  used  to  answer  shortest  path 
queries  efficiently.  Hourglasses  have  two  main  components:  chjiins  and  strings.  We 
have  described  the  chain  data  structure  in  the  preceding  subsection;  we  begin  this 
subsection  with  a  discussion  of  strings. 

If  we  were  concerned  only  with  the  lengths  of  shortest  paths  and  not  with  the 
paths  themselves,  we  could  represent  strings  by  their  lengths  alone.  However,  we 
need  additional  structures  to  represent  the  edges  of  a  string.  Our  representation 
uses  two  types  of  strings,  fundamental  and  derived.  Fundamental  strings  are  formed 
when  the  concatenation  of  two  hourglasses  collapses  the  multiple  paths  represented 
by  funnels  or  open  hourglasses  into  a  single  path.  A  fundamental  string  consists 
of  two  convex  chains  linked  by  a  tangent  edge;  any  of  the  pieces  may  be  null.  See 
Figme  6.10  for  examples.  Derived  strings  are  obtained  by  concatenating  two  or 
more  previously  defined  strings  (either  fundamental  or  derived).  A  derived  string 
is  produced  when  a  closed  hourglass  is  concatenated  with  another  hourglass. 

The  chains  that  appear  in  strings  are  obtained  by  truncating  hourglass  chains. 
We  can  represent  tnmcated  chains  using  the  data  structure  of  Section  6.3.2.  To  clip 
off  the  right  end  of  a  chain,  we  simply  create  a  new  chain  in  which  the  old  chain  is 
the  left  subchain,  the  right  subchain  is  null,  2ind  the  tangent  edge  is  the  new  chain’s 
right  endpoint. 

A  derived  string,  when  fully  expanded,  is  a  sequence  of  fundamental  strings. 
The  data  structure  for  a  derived  string  represents  this  sequence  as  the  leaves  of 
a  tree.  A  derived  string  is  represented  by  an  internal  node  whose  children  are  its 
constituent  strings.  The  tree  has  two  or  three  children  per  node.  See  Figure  6.11. 

We  construct  the  derived  string  trees  so  that  the  non-empty  fundamental  strings 
that  make  up  a  derived  string  can  be  extracted  in  constant  time  apiece.  When 
concatenation  would  cause  us  to  include  an  empty  string  in  a  derived  string,  we 
drop  that  string.  We  also  rule  out  nodes  in  the  tree  representation  of  derived 
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Figure  6.10.  Examples  of  fundamental  strings 

strings  that  have  only  one  child.  That  is,  when  two  hotnrglasses  are  concatenated 
Eind  the  resulting  string  is  just  the  string  belonging  to  one  of  the  two,  we  set  the 
new  hourglass  to  point  to  the  old  string,  rather  than  creating  a  new  derived  string 
node  with  only  one  child.  This  strategy  means  that  the  internal  nodes  of  a  derived 
string  tree  have  at  least  two  children.  Traversing  the  tree  takes  time  proportional 
to  the  number  of  leaves  (non-empty  fund2Lmental  strings).  Since  the  edges  of  each 
fundamental  string  can  be  extracted  in  constant  time  per  edge,  the  edges  of  any 
string  can  be  extracted  within  the  same  time  bound.  This  concludes  our  discussion 
of  the  string  data  structure. 

An  hourglass  is  built  of  at  most  four  convex  chains  and  one  string.  The  hourglass 
data  structure  has  pointers  to  these  five  components.  If  any  of  the  parts  is  missing, 
as  when  the  hourglass  is  open,  the  corresponding  pointer  is  null.  To  concatenate 
two  hourglasses,  we  find  the  relevant  common  tangents  of  their  chmns,  build  the 
appropriate  chains  and  strings,  and  set  the  pointer  fields  of  a  new  hourglass  struc¬ 
ture.  Each  new  chain  is  either  an  old  chain  or  a  convex  hull  of  two  old  chains. 
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Figure  6.11.  A  derived  string  and  its  data  structure 
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possibly  with  an  end  clipped  off  as  described  above.  The  new  string,  if  any,  is  either 
a  fundamental  string  or  a  derived  string  that  points  to  one  new  fundamental  string 
and  one  or  two  old  strings. 

We  can  conclude  from  the  preceding  discussion  that  the  hourglass  data  structure 
meets  its  specification.  Each  hourglass  and  string  record  uses  a  constant  number  of 
chains,  and  each  concatenation  requires  a  constant  number  of  chain  operations.  Be¬ 
cause  chains  satisfy  conditions  (Cl) — (C4),  the  hourglasses  built  from  them  satisfy 
requirements  (1) — (4). 


6.4  Improved  query  time  bounds 


Bypasses  are  devices  that  allow  some  people  to  dash  from  point  A  to 
point  B  very  fast  while  other  people  dash  from  point  B  to  point  A 
very  fast.  People  living  at  point  C,  being  a  point  directly  in 
between,  are  often  given  to  wonder  what’s  so  great  about  point  A 
that  so  many  people  from  point  B  are  so  keen  to  get  there,  and 
what’s  so  great  about  point  B  that  so  many  people  from  point  A 
are  so  keen  to  get  there.  They  often  wish  that  people  would  just 
once  and  for  all  work  out  where  the  hell  they  wanted  to  be. 

—  Douglas  Adams,  The  Hitchhiker’s  Guide  to  the  Galaxy  (1979) 


The  data  structures  described  above  give  an  O(log^  n)  time  boimd  for  shortest 
path  queries.  This  section  shows  how  to  cut  a  logarithmic  factor  off  that  bound. 
The  idea  behind  the  improvement  comes  from  the  following  observation:  if  many 
shortest  path  queries  are  answered,  hourglasses  corresponding  to  edges  high  up  in 
the  factor  graph  are  used  many  times;  furthermore,  the  cost  of  concatenating  them 
is  the  major  contribution  to  query  time.  If  we  could  find  a  way  to  bypass  most  of 
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the  computation  high  up  in  S*,  we  could  reduce  query  costs. 

Finding  the  shortest  path  from  p  to  q  has  four  main  steps:  (1)  locating  the 
points  iii  the  triangulation,  (2)  finding  the  subsequences  Dp  and  Dq  of  the  separat¬ 
ing  diagonals,  (3)  finding  the  hourglass  between  dp  and  d„  and  (4)  concatenating 
H{p,  dp)  and  H{dq,q)  with  H{dp,  dq).  Of  these  steps,  only  the  third  takes  more  than 
logarithmic  time.  To  speed  up  step  (3),  we  precompute  some  of  the  intermediate 
results  it  uses. 

The  decomposition  tree  S  is  balanced,  and  hence  there  axe  only  0(nf  log^  n) 
nodes  in  S  with  at  least  a  log^  n  descendants.  (The  constant  a  is  a  parameter  that 
can  be  adjusted  to  trade  increased  preprocessing  for  decreased  query  time.)  Let 
the  set  of  these  upper  nodes  be  U.  Modify  the  graph  S*  by  adding  to  it  all  edges 
from  nodes  of  U  to  their  ancestors.  Eadi  node  of  U  has  0(log  n)  ancestors,  so  the 
modification  adds  0(n/ log  n)  edges  to  S*.  The  hourglasses  corresponding  to  the 
additional  edges  can  be  computed  in  O(log  n)  time  apiece,  since  the  data  structures 
that  represent  them  have  logarithmic  height.  (Each  node  d  of  U  must  be  linked  to 
all  its  ancestors  in  S.  Linking  to  the  ancestors  in  order  along  the  path  from  d  to 
the  root  of  5  gives  the  claimed  bounds.)  Overall,  the  additional  hourglasses  take 
linear  time  and  space  to  construct. 

These  additional  hourglasses  provide  the  necessary  bypass  structxures  to  cut  a 
logarithmic  factor  off  the  shortest-path  query  time.  Let  p  and  q  be  the  query  points. 
In  logarithmic  time  we  find  the  subsequences  Dp  and  Dq  of  the  separating  diagonals. 
The  diagonal  d  is  the  least  common  ancestor  in  5  of  dp  and  dq.  If  d  is  not  in  U, 
then  the  cells  between  dp  and  d,  have  0(log^  n)  edges  altogether,  since  they  are 
contained  in  P^.  There  are  only  0(log(log*  n))  =  O(log  log  n)  hourglasses  that  need 
to  be  concatenated  in  order  to  produce  H{dp,dq),  and  each  concatenation  takes 
O(loglogn)  time.  Thus  step  (3)  takes  0((loglogn)2)  which  is  dominated  by 
the  time  of  the  other  steps. 

If  d  is  in  U,  then  the  situation  is  slightly  more  complex.  Let  dp  be  the  highest 
diagonal  in  Dp  that  is  not  in  U,  and  let  d+  be  the  next  higher  diagonal  in  Dp 
(the  successor  of  d~  in  Dp).  Define  d“  and  d^  similarly.  (S^  Figtire  6.12.)  As 
in  the  case  when  d  ^  U,  we  can  find  H{dp,dp)  and  H{d^  j^?)  O((loglogn)  ) 
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time.  The  hourglasses  H(d~,d^)  and  H(d^,d-)  correspond  to  edges  of  5*,  and 
hence  have  been  precomputed.  Since  d  is  an  ancestor  in  5  of  d+  and  <i+,  eind  all 
three  diagonals  are  in  U,  the  hourglasses  H{d+,d)  and  H(d,d^)  also  already  exist. 
Concatenating  these  six  hourglasses  to  get  H(dp,  d,)  takes  logarithmic  time,  and 
therefore  the  length  of  the  shortest  path  from  p  to  g  can  be  found  in  0(log  n)  time. 


Figure  6.12.  Definitions  for  query  speedup 


Theorem  6.1 .  There  is  a  linear  space  data  structure  for  storing  a  simple  polygon  P 
of  size  n  so  that  given  any  two  points  inside  P,  the  length  of  the  shortest  path 
joining  them  can  be  computed  in  O(logn)  time.  The  actual  path  can  be  extracted 
in  additional  time  proportional  to  the  number  of  turns  along  it.  Furthermore, 
this  shortest-path  query  structure  for  P  can  be  computed  in  linear  time  once  a 
triangulation  of  P  is  available. 
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6«5  Applications  and  extensions 

Our  method  can  yield  improved  bounds  for  various  questions  related  to  relative 
convex  hulls  [BT86].  For  example,  given  a  simple  polygon  F  of  n  sides,  we  can 
preprocess  its  exterior  in  0(n  log  log  n)  time  and  build  the  factor  graph  for  each 
bay  cut  off  by  the  convex  hull  of  F.  Now,  given  another  simple  polygon  Q  of  m 
sides  and  disjoint  from  F,  we  can  compute  an  implicit  representation  of  the  relative 
convex  hull  of  Q  with  respect  to  F  in  time  O(mlogn).  This  implicit  relative  convex 
hull  can  be  used  to  answer  the  moving  separability  question  of  Bhattacharya  and 
Toussaint  [BT86].  An  explicit  representation  of  the  relative  convex  hull  can  be 
extracted  in  additional  time  proportional  to  the  size  of  the  hull. 

A  related  question  has  to  do  with  computing  the  relative  convex  hull  of  m  points 
in  a  simple  polygon  F  of  size  n.  After  preprocessing  the  polygon  for  shortest  path 
queries,  we  can  solve  this  problem  in  additional  time  0(m  log  m  +  m  log  n).  The 
important  observation  is  that  we  can  define  a  “relative”  notion  of  the  cotmterclock- 
wise  test  for  three  points  a,  6,  and  c  in  F  that  is  testable  in  O(logn)  time  using 
our  structures.  The  test  looks  at  the  (implicit)  description  of  the  paths  7r(a,  b) 
and  7r(a,  c)  and  finds  the  place  where  these  paths  diverge;  the  relative  directions  of 
these  paths  at  the  jtmcture  tell  the  sign  of  the  test  for  (a,  6,  c).  One  can  prove  that 
the  outcome  of  this  test  is  the  same  no  matter  which  of  the  three  points  is  used 
as  the  path  origin.  Note  also  that  this  test  and  the  ordinary  counterclockwise  test 
(with  no  reference  to  F)  can  have  opposite  results  on  the  same  three  points.  With 
this  tool  at  our  disposal,  we  can  proceed  as  follows  to  determine  the  relative  convex 
hull:  Locate  all  the  m  points  in  the  triangles  of  the  triangulation  underlying  the 
preprocessing.  For  each  group  of  points  in  the  same  triangle  compute  its  standard 
convex  hull  and  discard  all  points  not  on  the  hiiU.  Suppose  a  total  of  m'  points 
remain.  The  cost  of  what  we  have  done  so  far  is  the  cost  of  the  preprocessing  for  F, 
plus  O(mlogm)  for  all  the  convex  hull  computations.  We  can  now  connect  these 
hulls  by  (implicit)  non-crossing  paths  lying  in  F.  See  Figure  6.13.  By  doubling  up 
these  paths  we  can  connect  all  m'  points  into  one  simple  polygon  Q  lying  inside  F. 
The  edges  of  this  polygon  are  shortest  paths  in  F;  also,  some  vertices  may  occur 
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up  to  three  times  along  the  boimdaxy  of  Q.  However,  we  can  still  use  a  “linear” 
convex  hull  algorithm  on  such  a  polygon,  for  the  only  tool  such  an  algorithm  needs 
is  the  counterclockwise  test  on  three  points  discussed  above.  Of  course  each  test 
now  costs  0(log  n),  for  a  total  of  0(m  log  n)  for  this  peirt  of  the  computation. 


Figure  6.13.  Computing  the  relative  convex  hull  of  some  points  in  P 

Another  interesting  application  has  to  do  with  computing  the  furthest  neighbor 
for  all  vertices  of  a  simple  polygon.  This  is  analogous  to  the  same  question  for  a 
convex  polygon  considered  in  [AKM*86],  only  now  we  meastire  distance  between 
vertices  by  the  shortest  path  inside  the  polygon.  By  combining  the  approaches 
of  (AKM*86]  and  the  present  chapter,  we  can  perform  this  computation  for  a  poly¬ 
gon  of  size  n  in  total  time  0{n  log  n).  A  recent  paper  of  Suri  [Sur87]  achieves  the 
same  bound  using  a  different  approach. 

As  a  final  observation,  we  note  that  the  preprocessing  described  in  Section  6.3.1 
can  be  adapted  to  solve  the  shooting  problem  for  P,  as  defined  by  Chazelle  emd 
Guibas  [CG85,  Section  3].  This  problem  calls  for  preprocessing  P  so  that,  given 
any  point  x  inside  P  and  any  direction  «,  we  can  quickly  compute  the  point  hH(x,  u) 
where  the  ray  emanating  from  x  in  direction  u  hits  the  boundary  of  P  for  the  first 
time.  If  the  polygon  P  hM  n  sides,  the  method  presented  in  [CG85]  solves  the 
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shooting  problem  in  0(log  n)  time  per  query,  after  building  a  structure  in  0{n  log  n) 
time  that  requires  0(n)  space.  We  adapt  our  algorithm  to  build  the  shooting 
structiure  used  by  [CG85]  in  linear  time. 

The  shooting  structure  of  Chaaelle  and  Guibas  uses  hourglasses  similar  to  those 
of  this  chapter  to  represent  all  lines  of  visibility  between  two  diagonals  of  P.  Because 
shooting  is  a  visibility  problem,  only  open  hourglasses  are  important.  In  fact,  not 
all  edges  of  an  open  hourglass  are  used.  The  structure  of  [CG85]  stores  only  those 
hourglass  edges  that  affect  the  lines  of  visibility  passing  through  the  hourglass, 
that  is,  the  hourglass  edges  that  are  visible  from  both  ends  of  the  hourglass.  The 
shooting  structure  contains  exactly  the  same  hourglasses  as  our  0(log^  n)  query  data 
structure:  it  has  an  hourglass  corresponding  to  each  edge  in  the  factor  graph  S*. 
However,  the  hourglasses  axe  not  represented  in  the  same  way  as  ours.  Instead 
of  storing  only  the  new  tangent  edges  with  an  hourglass  formed  by  concatenating 
two  old  ones,  the  shooting  structure  stores  some  old  edges  with  the  new  hourglass. 
Each  hourglass  edge  is  allocated  to,  and  stored  only  at,  that  hourglass  in  S*  that 
has  it  as  an  edge  and  is  the  last  one  formed  during  the  construction  algorithm  of 
Section  6.3.1.  This  means  that  at  each  stage,  as  we  compute  the  hourglass  of  (di,  ds) 
from  the  hourglasses  of  (di,d2)  and  {3.2,  dz),  we  need  to  find  the  common  tangents 
and  then  split  the  old  hourglasses  where  the  common  tangents  touch  them.  The 
extremal  hourgleiss  pieces  get  joined  by  the  tangent  to  form  the  new  hourglass,  while 
the  inner  pieces  are  left  associated  with  the  old  hourglasses.  See  Figure  6.5(a)  for 
an  illustration. 

As  in  Section  6.3.1,  the  general  step  of  the  merging  process  concatenates  an 
hourglass  H  with  several  others.  If  some  edges  of  H  had  to  be  duplicated  to  appear 
in  more  than  one  of  the  resulting  hourglasses,  the  shooting  structure  might  be 
superlinear  in  size.  However,  the  restriction  that  hourglasses  contain  only  edges 
visible  from  both  boimding  diagonals  means  that  each  edge  of  H  belongs  to  at  most 
one  of  the  resultant  hourglasses,  so  this  problem  does  not  arise.  The  splitting  and 
joining  operations  can  be  implemented  in  time  of  the  same  order  of  magmtude  as 
that  of  the  common  tzmgent  computation,  if  a  balanced  tree  structure  is  used  to 
represent  the  hourglasses.  In  this  way,  continuing  exactly  as  in  [CG85],  the  entire 
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shooting  structure  can  be  computed  in  linear  time  and  space.  We  therefore  have 
the  following  result: 

Theorem  6.2.  Given  a  triangulated  simple  polygon  P  of  n  sides,  it  is  possible  to 
construct,  in  linear  time  and  space,  an  auxiliary  structure  with  which  the  shooting 
problem  for  P  can  be  solved  in  0(log  n)  time  per  query.  These  botmds  are  optimal. 


I  shot  an  arrow  into  the  air. 
It  fell  to  earth,  I  knew  not  where; 
For,  so  swiftly  it  flew,  the  sight 
Could  not  follow  it  in  its  Bight. 

—  Henry  Wadsworth  Longfellow,  “The  Arrow  and  the  Song” 


I  shot  an  arrow  into  the  air. 
It  fell  to  earth,  I  knew  not  where; 
Until  next  day,  with  rage  profound, 
The  man  it  fell  on  came  around. 

He  showed  me  where  that  arrow  fell 
In  less  time  than  it  takes  to  tell. 

And  now  I  do  not  greatly  care 
To  shoot  more  arrows  in  the  air. 


Anonymous 


Chapter  7 

Conclusions  and  Continuations 


In  my  end  is  my  beginning. 
—  T.  S.  Eliot,  Four  Quartets  (1943) 


In  this  chapter  we  discuss  possible  extensions  and  further  applications  of  the 
ideas  used  in  the  preceding  chapters. 

The  foundation  for  many  of  the  results  of  this  thesis  is  the  close  relationship 
between  shortest  path  and  visibility  problems.  This  connection  is  a  fundamental 
one,  similar  to  geometric  duality  [CGL85]  in  some  respects.  Algorithms  that  solve 
one  type  of  problem  often  have  solutions  to  the  other  type  as  easy  corollaries. 
Chapters  4  and  5  especially  illustrate  the  use  of  this  connection.  I  expect  this 
relationship  to  bear  fruit  in  future  work. 

A  theme  of  increasing  importance  in  computational  geometry  is  the  idea  of 
measuring  algorithmic  complexity  in  a  way  that  depends  on  the  data.  The  algorithm 
of  Chapter  2  builds  the  visibility  graph  of  segments  in  0(n^)  time  and  space.  When 
the  algorithm  is  used  to  solve  shortest  path  problems,  this  gives  a  0(n^)  shortest 
path  algorithm.  Algorithms  such  as  that  of  Reif  and  Storer  [RS85]  or  those  of 
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Chapters  4  and  6,  which  take  into  account  the  geonaetric  relationships  between 
the  segments,  can  dramatically  improve  this  bound.  As  noted  in  Chapter  2,  the 
visibility  graph  itself  can  be  as  small  as  0{n).  It  is  therefore  worthwhile  to  look 
for  algorithms  that  build  it  in  time  proportional  to  its  size.  Chapter  5  solves  the 
problem  for  the  special  ceise  in  which  the  segments  form  a  simple  polygon. 

Chapters  4  and  6  enlarge  the  collection  of  problems  solvable  in  linear  time  for 
triangulated  simple  polygons;  I  expect  many  additional  problems  for  such  polygons 
to  have  linear-time  solutions.  For  example,  Suri  [Sur86a]  has  recently  extended  the 
techniques  of  Chapter  4  to  solve  in  linear  time  the  fc-visibility  problem,  in  which, 
given  a  simple  polygon  P  and  an  edge  c  of  P,  we  wish  to  partition  P  into  disjoint 
subparts  Pi,  P3, . . .  such  that  Pi  contains  all  points  in  P  directly  visible  from  some 
point  on  c,  P2  contains  all  points  in  P  visible  from  some  point  in  Pi  but  not  from  e, 
and  so  on.  This  result  and  those  of  Chapters  4  through  6  emphasize  the  centrality  to 
plamar  computational  geometry  of  the  question  of  whether  there  exists  a  linear-time 
triangulation  algorithm  for  simple  polygons. 

Finally,  I  expect  the  techniques  of  Chapter  6  to  find  wider  application.  The 
decomposition  provided  by  Chazelle’s  polygon  cutting  theorem  [Cha82]  is  ideally 
suited  to  euiswering  queries  that  relate  one  part  of  a  simple  polygon  to  another, 
possibly  quite  distant  part.  I  believe  that  shortest  path  queries  and  shooting  queries 
are  only  the  first  two  query  types  discovered  that  fit  this  pattern.  The  edge  retrieval 
lists  of  Section  6.3.2  may  also  be  applicable  in  other  settings. 


Appendix  A 

Balanced  Decomposition  of  a 
Binary  Tree  in  Linear  Time 


The  woods  decay,  the  woods  decay  and  fall. 
—  Alfred,  Lord  Tennyson,  Tithonus 


This  appendix  shows  how  to  compute  the  balzinced  hierarchical  decomposition 
of  the  triangrJated  polygon  P  that  was  described  in  Section  6.1.  The  decomposition 
is  based  on  the  polygon  cutting  theorem  of  Chazelle,  which  states  that  Jiny  polygon 
has  a  diagonal  that  splits  it  into  two  subpolygons  of  roughly  equal  size  [Cha82].  The 
algorithm  presented  here  finds  a  decomposition  of  P  by  operating  on  the  tree  T  that 
is  dual  to  the  triangxilation  of  the  polygon.  Deleting  an  edge  of  T  splits  T  in  two 
and  corresponds  to  adding  a  partictilar  diagonal  to  P.  If  we  can  find  an  edge  whose 
removal  splits  T  into  subtrees  of  roughly  equal  size,  this  will  give  us  the  desired 
splitting  diagonal  for  P.  In  the  remainder  of  this  appendix  we  focus  our  attention 
on  decomposing  the  tree  T;  we  can  translate  a  decomposition  of  T  directly  into  a 
decomposition  of  the  polygon  P. 
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We  now  restate  some  of  these  ideas  formally.  Let  T  be  a  rooted  binary  tree 
with  n  nodes.  (When  T  is  the  dual  of  a  triangulation,  we  can  pick  any  node  with 
at  most  two  neighbors  to  be  the  root.)  If  an  edge  c  of  T  is  removed,  then  T  is 
partitioned  into  two  subtrees.  If  we  now  similarly  partition  each  of  these  subtrees 
and  continue  doing  this  recursively  until  the  fragments  left  are  single  nodes,  we 
obtain  another  tree  structure,  which  is  known  aa  a  decomposition  of  T.  Such  a 
decomposition  is  czdled  balanced  if  there  is  a  positive  constant  a  such  that  each 
time  a  subtree  F  is  partitioned  by  the  removal  of  an  edge,  each  of  the  two  fragments 
obtained  has  size  at  least  O'lJF’l,  where  |i^|  denotes  the  size  (number  of  nodes)  of  F. 
We  make  two  remarks  on  such  decompositions.  First,  it  is  clear  that  in  a  balanced 
decomposition  the  fragment  containing  a  particular  node  v  can  be  split  only  0(log  n) 
times.  Second,  it  is  known  that  in  any  binary  tree  there  is  an  edge  whose  removal 
leaves  two  components,  each  with  at  least  [(n  +  1)/3J  nodes.  There  is  always  such 
an  edge  incident  to  a  centroid  of  the  tree.  (If  we  define  the  weight  of  a  vertex  v  to  be 
the  size  of  the  largest  subtree  that  remains  after  v  and  its  incident  edges  are  deleted, 
a  centroid  is  a  vertex  of  minimum  weight.  There  are  at  most  two  centroids,  and 
if  there  are  two,  they  must  be  adjacent  [Knu73,  pages  387-388].)  A  splitting  edge 
incident  to  a  centroid,  called  a  centroid  edge,  can  be  fotmd  in  linear  time  [Cha82]. 
As  a  result,  we  can  find  a  balanced  decomposition  of  T  with  a  =  1/4  in  0(n  log  n) 
time  total  by  recursively  applying  the  centroid  partition  to  each  fragment.  Note 
that  1/ 4  is  just  a  lower  botmd  on  a.  In  general,  the  balance  parameter  will  be  close 
to  1/3:  for  n  >  4,  a  >  1/3  —  l/(3n). 

We  show  in  this  appendix  how  a  centroid  edge  for  partitioning  each  fragment 
can  be  computed  in  less  than  linear  time,  after  some  appropriate  data  structures 
have  been  set  up.  As  a  result,  we  are  able  to  reduce  the  overall  time  for  obtaining 
a  balanced  decomposition  to  0(n).  Our  method  makes  use  of  an  auxiliary  ternary 
tree  A^,  for  brevity  written  A,  to  facilitate  the  splitting.  The  auxiliary  tree  A  lets  us 
find  in  O(log^  n)  time  an  edge  at  which  T  may  be  split.  Moreover,  in  the  same  time 
bound,  we  can  break  A  into  two  auxiliary  trees,  one  for  eaoh  of  the  two  fragments 
of  T  resulting  from  the  split.  By  applying  this  process  recursively  we  obtain  the 
desired  balanced  decomposition  of  T. 
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The  remainder  of  this  appendix  has  three  parts.  The  first  part  defines  and 
characterizes  the  auxiliary  tree  A  associated  with  a  binary  tree  T.  Section  A.l 
shows  how  to  construct  the  auxiliary  tree  in  linear  time,  given  T  as  input.  Finally, 
Section  A.2  explains  how  to  use  the  auxiliary  tree  to  find  a  balanced  decomposition 
of  T. 

To  describe  our  approach  we  adopt  the  following  conventions:  The  notation  r„ 
refers  to  the  subtree  of  T  rooted  at  v.  The  descendants  of  v  include  v  itself.  Also, 
all  logarithms  are  base  two. 

The  auxiliary  tree  A  has  the  same  nodes  as  T,  but  different  edges.  It  provides  a 
way  to  search  for  a  node  in  T  without  traversing  many  edges  of  T.  An  analogy  will 
mahe  the  idea  clear:  If  we  are  given  an  ordered  list  and  want  to  have  fast  access  to 
its  elements,  we  can  build  a  symmetrically  ordered  balanced  binary  tree  with  the 
Ust  elements  stored  at  the  nodes.  The  tree  A  plays  the  same  role  for  T  as  the  binary 
search  tree  does  for  the  list;  we  now  compare  and  contrast  the  two  search  trees.  The 
height  of  A,  like  the  height  of  the  binary  tree,  is  proportional  to  the  logarithm  of 
its  size.  Because  each  element  in  the  list  case  has  at  most  two  neighbors,  a  binary 
search  tree  gives  fast  access  to  the  list;  each  node  in  T  has  up  to  three  neighbors, 
so  in  our  case  A  is  a  ternary  tree.  The  binary  tree  implements  a  form  of  binary 
search:  to  locate  a  list  element,  we  look  at  the  element  r  at  the  root  of  the  tree, 
decide  whether  the  desired  element  lies  to  the  right  or  left  of  r  (if  it  is  not  equal 
to  r),  and  recursively  search  in  one  of  the  subtrees  if  necessary.  The  ternary  tree  A 
facilitates  a  similar  search:  to  locate  a  particular  node  of  T,  we  look  at  the  node  a 
at  the  root  of  A,  decide  which  of  the  three  subtrees  of  T  incident  to  a  contains  the 
desired  node  (if  a  is  not  it),  and  recursively  search  in  the  auxiliary  tree  for  that 
subtree  of  T.  We  can  define  the  binary  search  tree  based  on  numbering  the  list 
elements  consecutively  from  1  to  n.  The  element  whose  number  is  divisible  by  the 
highest  power  of  two  goes  at  the  root  of  the  search  tree.  The  subtrees  for  its  left 
and  right  sublists  are  built  recursively  using  the  same  rule.  (See  Figure  A.l.)  The 
construction  of  A  described  below  is  based  on  the  same  idea,  but  the  munbering 
scheme  is  more  complicated.  Because  T  is  not  necessarily  a  linear  list,  we  cannot 
simply  number  its  nodes  from  1  to  n.  Instead,  we  assign  numbers  to  each  node  v 
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roughly  based  on  the  size  of  the  subtree  T^.  These  numbers  (the  labels  described 
below)  are  calculated  during  a  postorder  traversal  of  T. 

To  define  the  auxiliary  tree  A^  we  attach  pairs  of  integers  to  the  nodes  of  T.  We 
associate  with  each  node  t;  of  T  an  integer  index  and  an  integer  label  The 
index  is  the  height  of  u  in  A  and  also  equals  the  number  of  trailing  zeroes  in 
written  in  binary.  Formally,  by  is  defined  by  the  following  rules,  which  we  call  the 
carry  algorithm:  If  u  is  a  leaf  of  T,  then  6^  =  1.  If  u  has  only  one  child  in  T,  say  w, 
then  by  =  by,  +  l.  If  u  has  two  children  in  T,  say  w  and  z,  the  label  6„  is  constructed 
as  follows:  Let  i  be  the  bit  position  of  the  leftmost  carry  in  the  binary  computation 
of  +  6,  4- 1  (bits  are  numbered  right-to-left,  with  bit  0  as  the  rightmost  bit).  One 
can  easily  check  that  the  addition  order  does  not  matter.  If  no  carry  occurs,  as 
when  by,  =  4c  and  6*  =  2,  then  we  set  i  =  0,  as  if  the  added  1  were  a  carry.  The  bits 
of  by  are  equal  to  those  o{by,  +  bz  +  l  at  and  to  the  left  of  position  i.  To  the  right  of 
position  i  all  bits  of  by  are  0.  Note  that  since  no  carry  arises  to  the  left  of  position  t, 
the  bits  of  6^  left  of  position  i  can  be  obtained  by  adding  the  corresponding  portions 
of  by,  and  bf.  Note  also  that  the  i-th  bit  of  by  must  be  1,  and  so  is  equal  to  i. 

An  example  will  help  make  the  definition  clear.  Suppose  that  v  has  two  children 
w  and  z,  and  that  by,  =  45  and  bg  =  71.  Then  by  is  obtained  as  follows: 

leftmost  carry 

- ^ - > 

by,  =  45  =  (0101101)2 

bg  =  71  =  (1000111)2 

_ + _ 

by,  +  bg  +  l  =  117  =  (1110101)2 

by  =  112  =  (1110000)2 

Note  that  this  definition  also  applies  when  v  has  one  or  zero  children:  we  simply 
treat  the  labels  of  the  missing  children  as  zero. 

If  we  use  bitwise  logical  operations,  we  can  compute  by  in  constant  time.  How¬ 
ever,  bitwise  operations  are  not  necessary:  Section  A.l  shows  how  to  compute  labels 
for  all  the  nodes  of  T  in  linear  time  without  using  bitwise  operations. 

When  T  is  a  path  with  the  root  at  one  end,  as  in  Figure  A.l,  the  labels  run 
consecutively  from  1  to  n,  and  the  indices  give  the  “ruler  function.”  A  key  property 
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of  the  ruler  function  is  that  two  nodes  with  equal  indices  are  separated  by  a  node 
with  a  higher  index.  We  say  that  the  indices  assigned  by  the  ruler  function  have 
the  separation  property: 

The  separation  property:  If  for  v  ^w,  then  there  is  a  node  u  on 

the  path  from  v  to  w  with  iu>  iv 

Note  that  except  for  notational  differences,  the  separation  property  is  identical  to 
the  lacuna  property  used  in  Chapter  6.  We  show  below  that  the  indices  assigned 
by  the  carry  algorithm  have  the  separation  property  not  just  when  T  is  a  path,  but 
also  when  it  is  an  arbitrary  binary  tree.  However,  before  we  prove  that  the  indices 
have  the  separation  property,  we  show  why  the  separation  property  is  desirable. 

The  separation  property  is  closely  related  to  decomposition  by  vertex-deletion. 
The  following  lemma  makes  the  connection  between  separation  and  decomposition: 

Lemma  A.l.  Let  T  be  a  binary  tree  whose  nodes  have  been  assigned  indices  that 
satisfy  the  separation  property.  If  all  nodes  with  indices  greater  than  some  positive  j 
are  removed  from  T  along  with  their  incident  edges,  each  remaining  subtree  of  T 
has  exactly  one  node  with  maximum  index. 

Proof;  We  prove  the  lemma  by  proving  a  stronger  result:  If  we  break  T 
into  subtrees  by  deleting  any  collection  of  edges,  each  subtree  has  a 
unique  node  with  maximum  index.  Suppose  to  the  contrary  that  some 
subtree  has  two  nodes  v  and  w  with  maximmn  index.  By  the  separation 
property  there  is  a  node  u  on  the  path  between  them  with  >  iv.  Every 
node  on  the  path  belongs  to  the  subtree,  so  cannot  be  the  maximum 
index.  I 

Because  the  indices  assigned  by  the  carry  algorithm  have  the  separation  prop¬ 
erty,  we  can  use  node  indices  to  build  the  auxiliary  tree  as  follows:  The  single  node  a 
with  highest  index  is  the  root  of  A.  Removal  of  this  node  and  its  incident  edges 
from  T  generates  at  most  three  subtrees  in  T,  which  recursively  define  the  (at  most) 
three  subtrees  in  A  of  the  root  node.  We  can  use  the  resulting  tree  A  to  search  in  T 
because  the  nodes  of  any  subtree  Av  a.re  contiguous  in  T.  Figures  A.l  and  A.2  give 
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two  examples  of  the  indices  and  labels  assigned  by  the  carry  algorithm  eind  their 
use  in  building  the  auxiliary  tree  A. 


8(3) 


Figure  A.l.  The  nodes  of  the  upper  tree  (a  single  path)  are  labelled 
with  The  indices  give  the  “ruler  function.”  The  lower  tree  is 

the  auxiliary  tree  built  using  the  labels. 

We  now  show  that  the  indices  assigned  by  the  carry  algorithm  have  the  separa¬ 
tion  property.  To  do  this,  we  attach  an  interpretation  to  the  labels  and  indices.  We 
begin  by  treating  6^  as  a  bit  vector:  b„  =  where  bv[j]  =  0  or  1.  If  u  is 

a  leaf  of  T,  then  b„  =  1;  that  is,  6„[0]  =  1  and  5„[;]  =  0  for  j  >  0.  The  index  is  0. 
If  V  has  children  w  and  z,  then  the  “leftmost  carry”  definition  of  is  equivalent  to 
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Figure  A.2.  The  nodes  of  the  upper  tree  (an  unbalanced  binary  tree) 
are  labelled  with  The  lower  tree  is  the  auxiliary  tree  built  using 

the  labels. 
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the  following  definition: 

iv  =  min{j  >  0  |  bru\j]  =  6,[j]  =  0  and  b^[k]  •  6^[A:]  =  0  for  all  fc  >  j}.  (A.l) 

r 

(If  V  has  only  one  child,  say  w,  then  bz[j]  =  6j[A:]  =  0  in  this  expression.)  Once  is 
known,  b^,  has  a  simple  definition: 

{brv\j]  +  bz\j]  i£j>iv 

1  if  j  =  iv  (A.2) 

0  if  j  <  it,. 

To  characterize  the  labels  and  indices  we  introduce  the  idea  of  path  indices.  The 
index  of  a  path  in  T  is  defined  to  be  the  maximum  index  of  all  interior  nodes  on  the 
path  (that  is,  excluding  the  two  end  nodes  of  the  path),  or  —1  if  the  path  contains 
no  interior  nodes  (it  consists  of  a  single  edge  or  even  of  a  single  vertex).  We  define 
ind(u,v)  to  be  the  index  of  the  path  joining  two  nodes  u  and  v.  The  separation 
property  can.  be  restated  in  terms  of  path  indices  as  follows:  Any  two  nodes  of 
equal  index  are  connected  by  a  path  with  greater  index.  The  following  lemma 
characterizes  the  labels  using  path  indices.  The  lemma  refers  to  descendants 
of  V]  recall  that  the  descendants  of  v  include  v  itself. 

Lemma  A.2.  Let  T  be  a  binary  tree  labelled  by  the  the  carry  algorithm.  For  each 
node  V  and  each  j  >  v  has  at  most  one  descendant  u  such  that  =  j  and 
ind(u,v)  <  j.  Furthermore,  b„\j\  =  1  if  and  only  if  such  a  descendant  exists. 

Proof:  The  proof  is  by  induction  on  the  height  of  v.  When  u  is  a 
leaf,  =  1  and  =  0.  The  only  descendant  of  u  is  u  itself,  and  the 
path  index  tnd(v,v)  is  —1,  which  is  less  than  To  prove  the  second 
statement,  note  that  b„[j]  =  1  only  for  j  =  i„  =  0. 

Now  suppose  that  v  has  two  children,  w  and  z.  We  consider  bv\j] 
for  j  >  i„.  Whenever  6,„[;]  =  6*[;]  =  0  for  j  >  i^,  v  has  no  strict 
descendant  u  with  =  j  and  ind{v,u)  <  j.  To  see  this,  let  u  be  a 
descendant  of  w  with  =  j;  because  =  0,  we  have  u  ^  w  and  j  ^ 
i,o.  Hence  ind{v,u)  =  max(i,o,*nd(w,u)).  lfj<  we  have  ind(v,u)  > 
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ivj  >  j.  On  the  other  hand,  if  j  >  the  induction  hypothesis  implies 
that  ind(WfU)  >  j,  and  hence  ind(v,u)  >  j. 

Whenever  K\j]  =  0  for  j  >  equation  A.2  above  implies  that 
bw[j]  =  K[j]  =  0.  Therefore  v  has  no  descendant  u  such  that  =  j  and 
ind(v,u)  <  j.  1£  j  =  *»,  we  have  6„[;]  =  1,  but  bu,[j]  =  bz\j]  =  0.  In  this 
case  t;  has  no  strict  descendant  u  with  =  t'v  and  ind(u,v)  <  i„,  but  v 
itself  is  a  descendant  of  v  that  meets  these  conditions. 

K  ftvb]  —  1  some  j  >  then  exactly  one  of  &«,[/]  and  hz\j] 
is  1;  without  loss  of  generality  suppose  =  1.  By  induction,  ev¬ 
ery  descendant  x  ot  z  with  t*  =  j  has  ind{z,x)  >  j,  and  therefore 
ind(v,x)  >  j.  Also  by  induction,  exactly  one  descendant  «  of  it;  has 
iu  =  j  and  ind{w,u)  <  j.  If  u  =  it>,  then  ind(v,u)  =  —1,  which  is 
less  than  j.  On  the  other  hand,  if  u  w,  then  j  —  >  i^,  and 

ind(v,  u)  =s  max(tw,  ind{w,  u))  <  j.  In  either  case  ind{v,  u)  <  j.  Node  u 
is  the  only  descendant  of  v  with  t'a  =  j  and  ind(v,«)  <  j. 

If  V  has  only  one  child,  say  u>,  the  definition  of  in  equation  A.l 
reduces  to  t„  =  min  >  0  |  bj[j\  —  0}.  The  arguments  needed  to  prove 
the  lemma  in  this  case  are  similar  to  those  used  above  and  are  left  to 
the  reader.  | 

We  use  the  preceding  lemma  to  show  that  the  indices  produced  by  the  carry 
algorithm  have  the  separation  property. 

Lemma  A. 3.  Let  T  be  a  binary  tree  labelled  using  the  carry  algorithm.  The 
indices  of  the  nodes  of  T  have  the  separation  property. 

Proof;  Let  x  and  y  be  two  nodes  of  T  such  that  =  iy.  The  proof 
splits  into  two  ceises  depending  on  whether  one  of  the  two  nodes  is  an 
ancestor  of  the  other. 

First  suppose  that  one  of  the  nodes,  say  x,  is  an  ancestor  of  the  other. 

If  any  interior  nodes  on  the  path  joining  x  and  y  have  index  ix,  let  u  be 
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the  one  closest  to  x;  otherwise  let  u  —  y.  Because  ind{x,x)  =  —1  is  less 
than  ix,  Lemma  A.2  implies  that  ind{x,u)  is  at  least  i*.  Since  no  node 
with  index  ix  lies  strictly  between  x  and  «,  we  have  ind{x,  u)  >  ix,  and 
hence  ind{x,y)  >  ix- 

If  neither  x  nor  y  is  an  ancestor  of  the  other,  let  v  be  their  lowest  com¬ 
mon  ancestor.  We  can  write  ind(x,y)  =  max(tn(i(x,t>),i„,  tnd(y,  u)).  If 
iv  >  is,  we  axe  done.  If  =  i*,  the  argument  of  the  first  case  shows 
that  ind{x,  v)  >  ix-  If  iv  <  ix,  we  show  that  at  least  one  of  ind(x,  v)  and 
ind(y,  v)  is  greater  than  i*.  Let  w  and  z  be  the  children  of  v,  with  w  an 
ancestor  of  x  and  z  an  ancestor  of  y.  Since  i*  >  iy,  at  most  one  of 
and  hz{ix]  is  1;  without  loss  of  generality  suppose  that  6u,[iar]  =  0.  For 
any  node  u  in  with  I'u  =  ix,  we  have  ind{w,u)  >  ix.  An  argument 
similar  to  the  one  above  implies  that  ind{w,  u)  >  ix  for  any  such  node  u. 
Therefore  ind{x,  w)  >  ix.  The  path  fi:om  x  to  w  is  a  subpath  of  the  one 
from  X  to  y,  and  so  ind{x,y)  >  ix.  I 

The  next  lemma  proves  two  more  facts  about  the  labels  produced  by  the  carry 
edgorithm.  It  shows  that  the  decomposition  represented  by  A  is  height-bounded, 
although  not  necessarily  balanced. 

Lemma  A.4.  Let  T  be  a  binary  tree  labelled  by  the  carry  algorithm.  Then 

(a)  for  each  node  v,  \Tv\  >  K,  and 

(b)  for  each  index  j,  there  are  at  most  L|T|/2^J  nodes  with  that  index  in  T. 

Proof:  The  proofs  of  both  statements  are  inductive.  The  first  follows 
easily  from  the  carry  definition  of  by.  For  a  leaf  v,  [Tvl  =  6^  =  1*  Since 
by  <  by,  +  bx  +  1,  induction  implies  that  |T„|  =  ITu,]  -|-  \Tz\  +  l>by. 

To  prove  (b),  let  us  define  D(v)  to  be  the  set  of  the  descendants  of  v 
whose  indices  are  at  most  iy  and  which  are  joined  to  v  by  paths  of  index 
less  than  Because  of  the  separation  property,  the  sets  D(v)  and  D(w) 
are  disjoint  for  distinct  v  and  w  with  iy  =  iy,:  If  TyCiTy,  is  empty,  then  so 
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is  D(v)nD(w).  If  TvHT^  is  nonempty,  without  loss  of  generality  assume 
that  t;  is  an  ancestor  of  w.  By  the  separation  property,  ind{v,w)  >  i„, 
and  hence  ind{v,  u)  >  for  any  u  €  D{w). 

We  next  show  by  an  inductive  argument  that  |I?(v)|  >  2**’.  This  is 
clearly  true  for  v  a  leaf:  |i?(v)|  =  1  >  2®.  If  u  has  children  w  and  z, 
then  a  node  q  is  in  D{v)  if  either  g  =  u  or  5  is  a  descendant  of  w  or 
of  z,  say  for  definiteness  a  descendant  of  w,  such  that  i,  <  iw  ^ 
and  ind{w,  q)  <  t'v.  In  this  latter  case,  let  j  <  iy  be  the  maximum  node 
index  along  the  path  from  w  to  q  (including  these  two  nodes),  and  let  u 
be  the  unique  node  along  this  path  with  index  j.  Then  clearly  q  €  D{u) 
and  by,\j]  =  1.  It  is  also  easy  to  check  the  converse  statement,  namely 
that  D{v)  contains  each  set  D(u)  for  a  descendant  u  of  ti;  or  of  z  that 
causes  by,[j]  or  bz\j]  to  be  1  for  any  j  <  i„.  But  the  sets  D{u)  are  all 
disjoint.  Suppose  to  the  contrary  that  q  €  D{u)  fl  D{u')\  then  without 
loss  of  generality  we  can  assiune  that  «  is  a  strict  descendant  of  u*  and 
both  are  descendants  of  w.  If  <  *„»,  then  u  cannot  have  caused  6u,[iu] 
to  be  1,  and  if  then  q  cannot  belong  to  D(u').  Thus,  by  the 

induction  hypothesis, 

P(v)l  ^  1  +  K[j])  > 

j<iv 

(because  1  +  &«,  +  6*  has  a  carry  at  the  i^-th  bit);  and,  since  all  the 
sets  D{y)  for  nodes  with  the  same  index  are  disjoint,  (b)  follows.  | 

Since  ia  is  the  height  of  A,  part  (b)  of  the  preceding  lemma  shows  that  A  has 
height  at  most  [lognj. 


A.l  Constructing  the  auxiliary  tree  A 

It  is  possible  to  determine  the  indices  of  the  nodes  and  build  A  dmring  a  single 
postorder  (depth-first)  traversal  of  T,  The  two  trees  T  and  A  have  the  same  node 
set.  To  distinguish  the  edge  sets,  we  speak  of  the  edges  of  T  and  of  the  Jinks  of  A. 
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For  each  label  6„,  the  construction  requires  a  vector  p„  of  pointers  to  nodes.  If  by[j] 
is  1,  then  Pv[j]  points  to  the  (unique)  descendant  of  u  in  T  that  has  index  j  and 
is  reachable  by  a  path  of  index  less  than  j.  The  stack  implicit  in  the  depth-first 
search  defines  a  path  tt  from  the  root  of  T  to  the  current  node.  The  construction 
maintains  by  and  pv  for  each  node  v  that  is  a  child  of  a  node  on  tt  but  is  not  on  tt 
itself.  Each  such  v  is  the  root  of  a  subtree  Ty.  The  vectors  by  and  pv  take  O(log  ir„|) 
space.  Since  the  subtrees  Ty  8ire  all  disjoint,  the  total  space  taken  by  all  the  vectors 
by  and  p„  is  linear. 

When  the  postorder  traversal  visits  a  node  v,  the  algorithm  constructs  by  and  py 
from  the  vectors  stored  at  the  children  w  and  z  ol  v.  At  the  same  time,  it  builds 
auxiliary  tree  links  for  nodes  that  appear  in  py,  and  p,  but  not  in  py.  Such  a  node 
(suppose  it  is  u  in  Ty,)  has  an  index  less  than  iy,  so  its  parent  in  A  is  either  u  or  a 
node  of  Ty,.  In  faet,  its  parent  in  A  is  the  node  with  minimal  index  j  >  iy  reachable 
from  u  by  a  path  in  T  of  index  less  than  iy.  These  observations  imply  that  if  j  is 
the  largest  integer  less  than  iy  such  that  by,\j]  =  1,  then  v  is  the  parent  of  py,\j]  in  A. 
Similarly,  if  j  and  k  are  integers  less  than  iy  such  that  j  <  k,  by,\j]  =  6,„[fc]  =  1,  and 
by,[l]  =  0  for  all  I  strictly  between  j  and  k,  then  py,[k]  is  the  parent  of  py,\j]  in  A. 
Linking  these  nodes  to  their  parents  in  A  takes  time  proportional  to  iy. 

When  the  algorithm  reaches  the  root  t  of  T,  it  links  the  nodes  pointed  to  by  pt 
as  if  <  were  the  child  of  a  node  with  index  greater  than  that  of  any  node  in  T.  The 
root  a  of  A  is  the  highest-indexed  node  appearing  in  p*. 

Even  without  logical  operations  on  the  by  vectors,  the  construction  of  A  requires 
only  linear  time.  When  the  traversal  visits  v  (with  children  w  and  z),  the  algorithm 
takes  time  proportional  to  the  number  of  links  made  plus  the  logarithm  of  the 
smaller  of  ITu/l  and  JT^I,  which  gives  a  linear  overall  bound.  In  linear  additional 
time,  a  traversal  of  A  can  be  used  to  compute  |A„|  and  store  it  at  each  node  u;  this 
information  is  needed  in  order  to  split  the  auxiliary  tree  later  on. 
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A. 2  Decomposing  the  tree  T 

Once  wfr  have  constructed  the  auxiliary  tree  A,  only  a  linear  amoxint  of  additional 
work  is  needed  to  find  a  balanced  decomposition  of  T.  The  algorithm  uses  A  to 
split  T  into  balanced  subtrees,  then  builds  an  auxiliary  tree  for  each  fragment,  and 
finally  decomposes  each  fragment  recursively.  The  general  step  of  the  algorithm  has 
two  parts,  which  we  describe  in  turn:  The  first  step  uses  the  auxilieiry  tree  to  find 
an  edge  of  T  whose  removal  splits  T  into  pieces  of  nearly  equal  size.  The  second 
step  splits  A  into  two  new  auxiliary  trees,  one  for  each  piece  of  T. 

The  algorithm  finds  a  splitting  edge  for  T  in  time  proportional  to  the  square  of 
the  height  of  A.  This  edge  is  incident  to  the  centroid  of  T;  we  call  the  edge  Cc.  The 
algorithm  finds  Cc  by  first  finding  the  centroid  of  T  (one  of  the  centroids  if  there 
are  two),  then  picking  the  edge  incident  to  it  whose  deletion  from  T  would  leave 
fragments  of  most  nearly  equal  size. 

Before  we  describe  the  search  algorithm,  we  describe  one  of  its  component  oper¬ 
ations,  finding  the  relative  locations  of  two  nodes  of  T.  Removing  a  node  v  from  T 
leaves  at  most  three  subtrees.  Given  a  query  node  w  not  equal  to  v,  we  want  to 
determine  quickly  which  subtree  contains  w.  If  we  compute  preorder  and  postorder 
numbers  for  the  nodes  of  T  at  the  same  time  as  we  compute  the  labels  and  indices, 
we  can  answer  such  queries  in  constant  time.  For  any  two  nodes  x  and  y,  r  is  an 
ancestor  of  y  if  and  only  if  PTe(x)  <  Pre{y)  and  Post{x)  >  Post(y).  The  query 
node  w  is  in  the  left  (right)  subtree  of  v  if  and  only  if  it  is  a  descendant  of  the  left 
(right)  child  of  u.  If  u  is  not  an  ancestor  of  w,  then  w  lies  in  the  third  subtree,  the 
“up”  subtree  of  v. 

To  find  the  centroid  of  T,  we  search  in  A  using  the  subtree  sizes  |>1„|.  Deleting 
a  node  v  and  its  incident  edges  in  T  splits  T  into  at  most  three  trees  L,  R,  and  U. 
If  the  centroid  search  visits  u,  it  must  determine  the  sizes  of  L,  i2,  and  17.  If  all  the 
sizes  are  at  most  |T|/2,  then  v  is  the  centroid.  If  one  subtree  is  larger  than  |T'|/2, 
then  the  centroid  lies  in  A^u,  where  w  is  the  child  of  u  in  A  that  belongs  to  the  large 
subtree.  (This  is  easy  to  prove  inductively.)  When  the  search  descends  from  v  to  w, 
it  eliminates  from  consideration  the  nodes  of  A„  not  in  Au,^  We  associate  this  set 
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with  V  and  call  it  elim{v).  The  size  of  this  set,  |eZtm(v)|,  is  |Av|  —  |-Au,|. 

We  use  the  eliminated  sets  to  find  the  sizes  of  L,  R,  and  U.  The  eliminated 
sets  associated  with  the  strict  ancestors  of  v  in  A,  along  with  with  A„,  partition 
the  nodes  of  T  into  disjoint  sets  of  contiguous  nodes.  For  each  node  u  that  is  an 
ancestor  of  v  in  A,  elim{u)  lies  in  exactly  one  of  X,  il,  and  U.  Let  x  be  a  child  of  v 
in  A,  and  suppose  that  x  lies  in  L.  Then  the  size  of  L  is  just  \Ax\  plus  |e/m(tt)| 
for  each  u  in  X  that  is  a  strict  ancestor  of  v  in  A.  (Using  preorder  and  postorder 
numbers,  we  can  determine  in  constant  time  whether  a  node  lies  in  X.)  The  search 
time  at  v  is  proportional  to  the  nixmber  of  ancestors  of  v  in  A‘,  the  total  time  to 
find  a  centroid  is  proportional  to  the  square  of  the  height  of  A.  Once  we  have  foimd 
a  centroid  c,  the  desired  splitting  edge  is  the  one  that  connects  c  to  the  largest 
subtree  that  would  remain  if  c  were  deleted. 

Deleting  Cc  from  T  results  in  two  subtrees  R  and  B  of  roughly  equal  size.  For  ease 
of  exposition,  let  us  assume  that  the  nodes  of  these  two  subtrees  are  painted  red  emd 
black,  respectively.  To  allow  recursive  splitting,  R  and  B  must  have  auxiliary  trees 
A^  and  A^  built  for  them.  The  indices  of  nodes  in  R  and  B  satisfy  the  separation 
property,  and  so  we  can  define  the  auxiliary  trees  A^  and  A^  just  as  A  was  defined 
on  page  156.  That  is,  the  root  r  of  A^  is  the  node  in  R  with  maximum  index,  and 
its  subtrees  are  the  auxiliary  trees  for  the  fragments  of  R  created  by  deleting  r;  A^ 
is  defined  similarly.  These  auxiliary  trees  do  not  have  the  structural  characteristics 
described  in  Lemmas  A.2  and  A.4,  but  they  are  quite  adequate  for  searching  in  R 
and  B.  We  create  and  A^  by  modifying  A;  because  the  construction  does  not 
start  from  scratch,  it  takes  time  proportional  to  the  height  of  A,  rather  than  to  |A|. 

The  construction  of  A^  and  A®  is  based  on  the  following  lemma: 

Lemma  A.5.  The  parent  in  A®  or  A®  of  a  node  v  is  its  nearest  strict  ancestor 
in  A  with  the  same  color. 

Proof:  Suppose  without  loss  of  generality  that  v  is  red.  Consider  delet¬ 
ing  the  nodes  of  R  in  decreasing  index  order.  Let  u  be  the  node  whose 
deletion  leaves  v  the  node  of  maximum  index  in  its  component,  and 
let  C®  be  the  component  in  which  v  has  maximum  index.  Then  u  is  the 
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parent  of  v  in  A^.  Define  C  to  be  the  component  of  T  that  contains  v 
after  ail  nodes  with  indices  at  least  are  deleted.  The  nodes  of  C  be¬ 
long  to  j4„,  so  u  is  an  ancestor  of  v  in  A.  The  interior  nodes  of  the  path 
in  A  from  u  to  u  lie  in  C  but  not  in  C^,  so  they  must  all  be  black.  | 

Splitting  A  to  form  the  two  new  auxiliary  trees  is  relatively  straightforward.  By 
Lemma  A.l,  each  of  the  subtrees  R  and  B  has  a  tinique  node  with  maximum  index. 
These  nodes,  one  of  which  is  a,  are  the  roots  of  A^  and  A^.  Let  v  be  the  endpoint 
of  Cc  with  smaller  index.  The  path  tt  in  A  from  a  to  v  includes  the  other  endpoint 
of  Cc.  Because  the  nodes  are  augmented  with  their  preorder  and  postorder  numbers 
in  T,  a  constant-time  test  can  determine  the  color  of  a  node.  (If  /  is  the  endpoint 
of  Cc  that  is  lower  in  T,  then  T/  contains  all  the  nodes  of  one  color  and  none  of  the 
other  color.  Note  that  the  definitions  of  I  and  v  are  unrelated.)  The  root  of  A^ 
is  the  highest  red  node  on  tt;  the  root  of  A^  is  the  highest  black  node.  The  new 
auxiliju^r  trees  are  constructed  by  dividing  ir  into  two  monochromatic  paths.  In  A^ 
and  A^,  every  node  w  onir  has  as  its  successor  the  next  node  on  tt  with  the  same 
color  as  w.  The  subtree  sizes  {Aj.]  are  still  valid  in  A^  and  A^  except  at  nodes  x 
on  TT,  where  they  must  be  recomputed.  These  changes  take  time  proportional  to 
the  length  of  the  path  it.  To  see  that  these  modifications  of  A  are  sufficient,  observe 
that  at  most  one  tree  Ay  is  dichromatic  for  nodes  u  of  a  given  depth  in  A  (because 
a  dichromatic  tree  includes  both  ends  of  Cc),  and  that  after  v  is  modified,  each  node 
has  atixiliary  tree  links  only  to  nodes  of  its  own  color. 

To  analyze  the  cost  of  this  construction,  we  note  that  auxiliary  tree  nodes  cannot 
increase  in  height  as  the  decomposition  proceeds,  and  that  the  total  number  of  nodes 
in  A  of  height  k  is  at  most  n/2*.  The  cost  of  splitting  an  auxiliary  tree  whose  root 
has  height  k  is  0{k^).  Furthermore,  a  node  v  with  height  k  in  A  appears  at  the 
root  of  an  auxiliary  tree  at  most  0{k)  times  total  during  the  decomposition,  since 
it  takes  only  0{k)  splittings  to  reduce  the  size  of  the  auxiliary  tree  rooted  at  v  from 
0(3*)  to  1.  Therefore  the  whole  decomposition  takes  time 

UoK"J  „ 

A:=0  ^ 

The  preceding  discussion  constitutes  a  proof  of  the  following  theorem: 
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Theorem  A.l.  It  is  possible  to  6nd  a  balanced  decomposition  of  an  arbitrary 
binary  tree  in  linear  time. 

As  noted  at  the  beginning  of  the  appendix,  this  linear  algorithm  for  decomposing 
an  arbitrary  binary  tree  also  provides  a  linear-time  method  for  finding  a  balanced 
decomposition  of  a  triangulated  simple  polygon. 

Remark:  This  algorithm  can  easily  be  extended  to  decompose  trees  in  which  each 
node  has  botmded  degree. 

Remark:  An  alternative  0(n)-time  technique  for  finding  a  balanced  tree  decom¬ 
position  is  obtained  by  using  a  simplified  version  of  the  dynamic  tree  data  structure 
(as  described  in  [Tar83,  Ch.  5]),  which  supports  logarithmic-cost  tree-splitting  op¬ 
erations. 

You  have  endured  worse  things, 
God  will  grant  an  end  even  to  these. 


—  Virgil,  Aeneid,  I.  199 
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